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FOREWORD 


INTRODUCTION  TO  “COMBAT  SYSTEMS” 


Combat  systems  are  the  integrated  systems  that  give  modern  military  units  their  enormous 
warfighting  (and  peacekeeping)  potential.  In  the  current  context,  combat  systems  include  almost 
everything  on  board  modern  platforms  (ships,  submarines,  ground  vehicles,  aircraft,  or  spacecraft 
are  platforms)  except  those  related  to  structure,  mobility,  and  habitability  (if  the  platform  is 
manned).  The  elements  that  make  up  modern  combat  systems  include  the  sensor  systems  that 
detect  and  track  the  targets,  the  weapons  that  engage  the  targets,  the  command  & control  systems 
that  authorize  or  assist  the  engagement  of  the  targets,  and  anything  else  needed  to  integrate  the 
elements  into  a platform  and  permit  the  operator  to  use  and  control  them. 

This  set  of  books  evolved  from  notes  used  in  two  one-quarter  courses  to  teach  the  elements 
of  combat  systems  and  systems  integration  to  military  officers  in  the  Total  Ship  Systems  Engineer- 
ing (TSSE)  program  at  the  Naval  Postgraduate  School  (NPS)  in  Monterey,  California.  Students  in 
this  multidisciplinary  program  are  drawn  from  three  curricula  at  NPS:  Naval  & Mechanical 
Engineering,  Electrical  Engineering,  and  Combat  Systems  Science  & Technology.  The  students 
take  six  elective  courses  (1  on  systems  engineering,  1 on  shipboard  power  systems,  1 on  naval 
architecture,  1 on  hull,  mechanical,  & electrical  integration,  and  2 on  combat  systems  elements  and 
integration)  and  perform  a six-month  capstone  ship  design  project  in  addition  to  fulfilling  all 
requirement  for  M.  S.  degrees  in  Mechanical  Engineering,  Electrical  Engineering,  or  Applied 
Physics,  respectively.  The  students  are  typically  junior  naval  officers  (mostly  0-3  - Lieutenants) 
who  have  served  at  least  two  tours  aboard  ship  and  have  been  given  the  opportunity  to  obtain 
graduate  technical  educations  prior  to  their  next  promotions.  The  majority  of  these  are  Engineering 
Duty  Officers,  although  a number  have  been  Unrestricted  Line  Officers.  However,  the  classes  have 
also  had  a number  of  Coast  Guard  officers,  and  even  a few  Marine  Corps  and  Army  officers  and 
Department  of  Defense  civilians.  Although  most  of  the  students  were  interested  in  Naval  surface 
ships  and  came  from  the  Surface  Warfare  community,  a number  were  aviators,  submariners,  space 
systems  operators,  and  even  armor  officers.  This  fact  coupled  with  the  author’s  belief  in  the 
necessity  of  joint  warfare  led  him  to  tailor  the  course  to  avoid  restriction  of  the  discussions  to 
“Surface  Navy  only”  systems.  A Systems  Engineering  and  Analysis  (SEA)  program  was  developed 
a few  years  ago  to  teach  technology  to  future  requirements  setters  in  the  Unrestricted  Line 
Community.  The  two  TSSE  courses  on  combat  systems  were  expanded  to  four  courses  in  the  SEA 
curriculum,  necessitating  the  incorporation  of  more  material  into  the  texts. 

In  recent  years  a small  number  of  foreign  military  officers  have  been  allowed  to  enter  the 
program.  This  required  that  the  course  materials  be  unclassified  and  available  for  unlimited 
distribution.  This  was  facilitated  by  taking  a first  principles  approach  to  combat  systems  rather  than 
dissecting  existing  systems.  The  laws  of  nature  are  open  to  all  who  observe  them  and  there  is  little 
in  any  generic  sensor  or  weapon  that  can  be  classified.  By  being  forced  to  avoid  discussing  details 
of  existing  hardware,  we  gained  the  advantage  of  avoiding  a focus  on  the  present.  We  can  address 
the  basic  principles  that  govern  the  performance  of  all  systems:  old,  new,  and  yet-to-be-developed. 
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Another  consideration  in  the  design  of  the  courses  was  the  diversity  of  the  students. 
Because  the  students  come  from  vastly  different  undergraduate  backgrounds,  the  only  commonality 
that  could  be  assumed  was  the  curriculum  requirements  of  having  taken  one-year  of  calculus  and 
one-year  of  calculus-based  physics  at  the  undergraduate  level.  However,  these  courses  have 
typically  been  taken  by  the  students  at  least  five  to  eight  years  prior  to  their  beginning  the  TSSE 
program  and  the  skills  taught  in  those  classes  have  seldom  been  required  to  perform  their  interven- 
ing military  jobs.  This  meant  that  all  necessary  technical  background  had  to  be  contained  within  the 
courses  themselves.  Another  consideration  was  the  fact  that  the  courses  were  intended  to  educate 
systems  engineers  in  sufficient  detail  to  be  able  to  interact  with  element  designers,  not  to  educate 
element  designers  in  the  intricacies  of  their  craft.  However,  based  on  the  author’s  fifteen  years  as 
an  element  designer  followed  by  fifteen  more  years  as  a systems  engineer  (element  integrator),  it 
was  decided  that  excess  design-level  information  was  preferable  to  insufficient  design-level 
information.  Shortcomings  in  the  author’s  own  breadth  of  knowledge  will  guarantee  less  than 
complete  detail  in  more  a than  a few  of  the  topics  to  be  covered.  Nevertheless,  even  experienced 
element  designers  may  find  organizing  principles  and  a philosophical  framework  connecting  their 
specialty  to  other  fields  in  ways  that  they  may  not  have  consciously  recognized  from  their  own 
experience. 

This  material  is  taught  as  part  of  a broader  systems  engineering-oriented  curriculum. 
Although  this  is  not  a firm  requirement  - the  material  contained  in  these  volumes  is  relatively  self- 
contained  - the  author  feels  that  a systems  engineering  approach  to  the  subject  of  combat  systems 
is  essential  for  successful  utilization  of  the  principles  presented  here.  In  both  of  the  curricula 
currently  using  this  material,  the  students  take  at  least  one  course  in  systems  engineering  principles 
and  processes.  This  course  may  be  taken  prior  to  or  concurrent  with  the  course  that  covers  the 
material  in  the  first  two  volumes  of  this  text.  The  systems  engineering  process  that  is  taught  is 
captured  in  Figure  F-1.  To  facilitate  internalization  of  this  overall  process,  the  combat  systems 
courses  attempt  to  emphasize  those  aspects  that  might  impact  requirements,  functional  capabilities, 
concept  synthesis,  and  performance  analysis. 

The  TSSE  program  was  established  because  the  Navy  realized  that  combat  systems  and  ship 
hull,  mechanical,  and  electrical  (HM&E)  design  had  become  stovepiped  with  combat  systems  and 
HM&E  engineering  being  in  separate  stovepipes.  Requirements  entered  the  bottom  of  the 
stovepipes  and  systems  came  out  the  top.  However,  there  was  no  interconnections  along  the  length 
of  the  stovepipes.  The  top  was  the  first  time  that  combat  systems  interacted  with  HM&E.  As  a 
result,  there  were  interface  problems  and  inefficiencies.  TSSE  was  intended  to  provide  cross 
channels  between  the  stovepipes  to  prevent  the  standard  problems  that  arose.  TSSE  appears  to  have 
been  very  successful  in  doing  this. 

However,  the  author  has  long  recognized  that  it  is  not  just  combat  systems  and  HM&E  that 
suffer  from  suffer  from  stovepipe  problems.  Sensors,  weapons,  and  command  & control  have  there 
own  stovepipes.  Each  of  these  is  composed  of  other  stovepipes  and  few  have  any  significant 
interactions.  In  sensors,  radar  is  taught  as  a completely  different  subject  area  from  sonar.  Radar 
engineers  seldom  know  much  of  anything  about  sonar  and  vice  versa.  And  both  radar  and  sonar  are 
taught  as  a completely  different  subject  areas  from  electro-optical  sensors  and  field  sensors.  The 
author  recognizes  that  this  is  not  only  unnecessary,  but  also  counter-productive,  at  any  level,  but 
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Figure  F-1.  The  systems  engineering  process. 


especially  at  the  systems  engineering  level.  Combat  systems  should  not  be  taught  or  practiced  as 
a set  of  stovepipes,  but  as  an  integrated  discipline.  The  key  is  in  recognizing  the  many  common 
aspects  of  these  systems  elements  and  teaching  them  once,  and  then  pointing  out  the  individual 
areas  where  these  common  elements  may  be  applied,  and  providing  the  necessary  language  to 
describe  and  use  these  common  elements  in  the  same  fashion  as  taught  individually  by  the 
stovepiped  disciplines.  This  will  be  illustrated  further  as  we  proceed  through  the  foreward. 

Breadth  and  depth  are  required  to  be  an  outstanding  systems  engineer  - this  is  also  true  of 
a combat  systems  engineer.  Systems  engineers  need  to  be  able  to  perform  trades  between  complex 
alternatives.  This  requires  at  a minimum  the  ability  to  perform  “back  of  the  envelope”  calculations 
of  sensor,  weapon,  and  communication  performance.  Since  few  teams  can  afford  to  have  dedicated 
engineers  on-hand  to  cover  every  possible  specialty  discipline,  systems  engineers  are  often  called 
to  “fill-in”  and  address  whatever  tasks  cannot  be  accommodated  by  the  rest  of  the  team.  In  his 
career  as  a systems  engineer  for  a defense  contractor,  the  author  was  at  different  times  required  to 
perform  reliability  predictions,  nuclear  hardness  analysis,  maintainability  assessments,  threat 
assessments,  operations  analyses,  fault  analyses,  safety  assessments,  development  of  pre-planned 
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product  improvement  strategies,  manpower  planning,  establishment  of  embedded  training 
requirements,  and  conceptual  designs  of  weapon  components  outside  his  area  of  expertise.  The 
author’s  original  specialty  was  lasers  and  their  applications  - yet  he  became  involved  in  designing 
fuzes,  propulsion  schemes,  signal  processing  techniques,  seekers,  warheads,  launchers,  and  power 
supplies,  among  others,  in  addition  to  all  of  the  normal  activities  of  systems  engineering.  The 
requirement  of  breadth  and  depth  is  further  supported  both  by  the  need  of  systems  engineers  to 
know  the  value  and  potential  contributions  of  every  specialty  discipline  involved  in  complex  design 
tasks  (to  insure  that  they  are  considered  equally  in  the  overall  design)  as  well  as  the  need  to  know 
enough  of  the  jargons  of  those  specialty  disciplines  to  be  able  to  communicate  as  near  peers  with 
the  specialty  experts.  The  system  engineer  is  the  final  technical  arbiter  of  disputes  between 
specialty  disciplines.  Specialty  engineers  are  much  more  likely  to  accept  such  arbitration  if  they 
feel  the  arbiter  understands  their  point  of  view  and  their  problems  and  is  therefore  capable  of 
making  enlightened  technical  decisions.  It  is  the  author’s  belief  that  one  cannot  be  a systems 
engineer  without  broad  and  deep  knowledge  of  the  domain  in  which  that  systems  engineering  is 
being  carried  out. 

Performing  trade  studies  of  widely  differing  technical  solutions  and  synthesizing  truly 
innovative  concepts  requires  the  systems  engineer  to  collect  and  evaluate  data  from  highly  diverse 
sources.  It  is  quite  common  for  the  multiple  authors  of  that  data  to  have  used  widely  differing 
terminology  to  describe  basically  the  same  thing.  For  example,  in  the  collection  of  microwave 
attenuation  data  one  group  may  quote  their  results  in  terms  of  loss  tangents,  another  in  terms  of 
complex  refractive  indices,  and  a third  in  terms  of  extinction  coefficients.  A “Rosetta  Stone”  is 
needed  to  help  translation  of  the  different  terminology  into  a common  language.  Good  systems 
engineers  have  created,  inherited,  or  otherwise  acquired  a number  of  such  Rosetta  Stones  during 
their  career.  In  the  present  text,  the  author  makes  every  attempt  to  present  the  important  Rosetta 
stones  that  he  has  acquired  or  developed. 

Recognizing  that  there  is  a common  foundation  to  diverse  technologies  and  that  common 
formalisms  can  be  used  to  analyze  seemingly  different  problems  aids  in  recognizing  the  need  for 
using  a Rosetta  Stone  when  available.  In  the  sensor  sections,  the  author  has  explicitly  tried  to 
define  and  present  the  minimum  number  of  methodologies  needed  to  address  all  known  and 
hopefully  knowable  sensor  technologies.  Once  the  methodologies  are  mastered  and  applied  to  one 
technology,  the  student  is  90%  of  the  way  to  understanding  all  “analogs”  of  that  technology.  For 
example,  once  one  has  learned  basic  detection  theory,  estimation  theory,  and  the  essentials  of 
propagation  (attenuation,  diffraction,  refraction,  scattering,  and  interference),  and  has  applied  them 
to  developing  an  understanding  of  radar,  there  is  no  longer  any  need  to  spend  comparable  effort  in 
developing  an  understanding  of  sonar.  Virtually  every  phenomenon  affecting  radar  has  a 
counterpart  affecting  sonar.  An  understanding  of  “radar”  coupled  with  learning  the  domain  specific 
terminology  of  sonar,  permits  an  equivalent  understanding  of  “sonar”.  The  converse  is  also  true. 
Since  argument  by  analogy  is  a major  skill  that  successful  systems  engineers  possess  and  use,  we 
present  the  student  with  analogy  after  analogy.  This  is  done  not  only  to  demonstrate  the  skill,  but 
also  to  condense  an  otherwise  enormous  volume  of  seemingly  unrelated  material  (and  having 
considerable  duplication  in  the  author’s  view)  into  a manageable  set  of  courses. 


At  points  in  the  presentation  the  author  makes  strong  points  concerning  problems  that 
system  procurers  often  encounter.  A certain  amount  of  cynicism  may  be  detected  here.  The  author 
has  been  given  too  many  reasons  not  to  be  slightly  cynical.  He  seen  (and  even  worse,  experienced) 
the  cancellation  of  too  many  programs  prior  to  their  completion.  In  some  instances,  the  cancella- 
tions came  about  because  of  inadequacy  of  the  technology;  in  others,  due  to  inadequacy  of  the 
program  management.  In  still  others,  there  were  attempts  to  go  against  political  expediency.  The 
systems  engineer  on  a program  is  seldom  the  highest  authority  on  that  program  and  cannot  control 
or  even  influence  all  of  these  factors.  However,  he  can  help  guarantee  that  cancellation  is  not  due 
to  engineering  incompetence.  Where  there  are  lessons  to  be  learned  from  past  mistakes  (and  since 
the  author  considers  himself  to  be  an  expert  - experts  are  defined  as  those  who  have  already  made 
every  mistake  that  novices  are  likely  to  make  - he  has  made  many  himself)  those  lessons  should  be 
shared.  In  studying  “lessons  learned”,  a little  cynicism  is  often  justified. 

Another  aspect  of  philosophical  outlook  that  shows  up  in  this  book  stems  from  the  author’s 
intermittent  involvement  with  the  “electronic  warfare”  community.  He  has  been  consistently 
impressed  by  the  ingenuity  with  which  countermeasures  experts  can  find  ways  to  negate  the 
performance  of  “flawlessly  designed”  military  systems.  This  is  not  universally  true  of  systems 
designers.  Many  are  blissfully  unaware  of  the  existence  of  a countermeasures  community.  There 
is  also  a strong  tendency  for  design  engineers  to  be  enamored  of  the  technologies  with  which  they 
work.  This  leads  to  an  unwillingness  to  closely  examine  the  shortcomings  of  those  technologies. 
To  prevent  certain  disaster,  the  systems  engineer  must  not  also  share  this  trait.  At  the  same  time  as 
he  is  providing  counsel  and  guidance  to  making  a system  work,  the  system  engineer  should  also  be 
thinking  “How  can  I break  it?”.  Someone  working  for  our  adversaries  is  certainly  asking  that 
same  question.  Countermeasures,  vulnerability,  and  survivability  find  frequent  and  serious 
discussion  throughout  these  volumes. 

Lastly,  this  is  not  merely  a textbook  that  condenses  a well-established  body  of  knowledge. 
Nor  is  it  a handbook  that  collects  the  varied  works  and  thoughts  of  others.  Although  it  is  a textbook 
that  presents  a condensed  body  of  knowledge  and  it  was  designed  to  provide  much  of  the  utility  of 
a handbook,  it  also  contains  considerable  original  material  that  the  author  has  not  found  the  time 
or  place  to  publish  before.  Many  derivations  are  the  author’s  (references  to  the  original  source  are 
given  when  the  author  is  merely  following  a standard  derivation)  as  are  many  of  the  tables  and 
figures  presented.  Finally,  the  author  has  made  an  attempt  to  present  a coherent  philosophy  and 
comprehensive  presentation  of  combat  systems.  Some  of  the  original  material  stems  from  this 
effort.  He  hopes  the  following  pages  reflect  at  least  partial  success  in  that  attempt. 

In  summary,  this  work  attempts  to  present  a coherent  approach  to  combat  systems  by 
making  maximum  use  of  “universal”  tools  and  techniques.  These  include  Rosetta  stones,  analogies, 
common  formalisms,  lessons  learned,  and  “How  can  I break  it?”,  combined  with  considerable 
original  material  to  unify  prior  work  into  a comprehensive  presentation  of  the  subject. 

Figure  F-2  portrays  the  big  picture  of  combat  systems  that  this  work  is  attempting  to 
document.  A fully  functioning,  integrated  combat  system  will  consist  of  sensors  (often  referred 
to  as  detection  elements),  weapons  (often  referred  to  as  engagement  elements),  and  command 
& control  systems  (often  referred  to  as  control  elements)  skillfully  integrated  into  a platform  in 


XXIX 


Figure  F-2.  Integrated  combat  systems  - the  “Big  Picture”. 


such  a way  as  to  optimize  the  performance  of  the  whole.  The  term  sensors  covers  a wide  variety  of 
devices  ranging  from  the  simple  and  minute  (such  as  a simple  telescope)  to  the  enormous  and 
complex  (such  as  a phased  array  microwave  radar).  The  systems  engineer  needs  to  be  able  to  select 
from  this  variety,  compare  the  performance  of  diverse  sensors  on  a logical  and  quantifiable  basis, 
and  combine  the  outputs  from  multiple  sensors  to  perform  complex  warfighting  functions. 
Experience  has  taught  that  performing  this  task  requires  two  kinds  of  knowledge:  the  factors  that 
influence  sensor  performance  and  the  technologies  relevant  to  specific  sensor  systems. 

Performance  influences  include  the  characteristics  of  targets  that  provide  observables 
capable  of  being  sensed,  the  fundamental  characteristics  of  the  signal  (e.g.,  the  unique  characteris- 
tics of  electromagnetic  radiation),  weather  and  anomalies  in  the  propagation  of  the  signal,  the 
explicit  flow  of  energy  from  the  target  to  sensor  taking  losses  and  diffusion  into  account,  and  the 
specific  functions  being  performed  (e.g.,  detection  or  tracking).  Many  of  these  influences  affect 
many  different  sensors  in  the  same  way.  For  example,  ducting  (a  propagation  anomaly)  can  occur 
for  most  sensor  types  including  both  electromagnetic  and  acoustic  sensors.  The  results  of  detection 
theory  can  be  applied  equally  to  all  sensors  with  a detection  function.  Some  affect  different  sensors 
quite  differently.  For  example,  weather  minimally  affects  many  radars,  dramatically  affects  electro- 
optical  sensors,  and  only  indirectly  affects  sonar  systems.  Understanding  these  performance 
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influences  is  essential  to  effective  design  and  use  of  sensor  systems.  It  is  helpful  to  have  this 
understanding  before  studying  the  sensors  themselves. 

Nevertheless,  of  equal  importance  are  the  technologies  that  are  used  to  implement  the 
sensor  systems.  One  cannot  make  decisions  about  the  utility  or  lack  of  utility  of  a radar  if  one 
doesn’t  know  what  a radar  is.  In  military  applications  we  must  be  concerned  about  sensors  ranging 
from  radars  (microwave  radars,  laser  radars,  synthetic  aperture  radars,  bistatic  radars,  over-the- 
horizon  radars,  etc.)  to  infrared  sensors  (thermal  imagers,  infrared  search  and  track,  reticle  seekers) 
to  electro-optical  sensors  (television,  image  intensifiers,  direct  vision)  to  acoustic  sensors  to  radio 
frequency  intelligence  sensors  to  other  diverse  kinds  of  sensors  (radiation  monitors,  star  trackers, 
magnetic  anomaly  detectors,  etc.).  Many  of  the  concepts  used  in  one  kind  of  sensor  find  analogs 
in  other  kinds  of  sensor.  For  example,  there  is  no  fundamental  difference  between  phased  array 
antennas  in  radar  systems  and  phased  array  transducers  in  sonar  systems.  It  is  surprising  how  many 
instances  of  similar  analogies  can  be  identified. 

Sensors  are  an  essential  part  of  a total  combat  system.  It  is  difficult  to  employ  weapons  if 
the  targets  cannot  be  located.  On  the  other  hand,  without  weapons,  located  targets  cannot  be 
attacked.  Sensors  not  only  locate  targets,  they  are  integral  to  the  performance  of  the  weapons 
themselves.  Without  seekers  or  fuzes  or  navigation  systems,  most  weapons  would  not  be  effective. 
This  is  why  sensors  are  emphasized  first.  However,  the  systems  engineer  must  understand  all 
aspects  of  the  problem.  Understanding  of  weapons  also  requires  two  kinds  of  knowledge:  the 
means  by  which  weapons  accomplish  their  function  (effectors)  and  the  means  by  which  the 
effectors  are  delivered  to  the  targets  (delivery  systems).  Effectors  include  directed  energy  and  the 
means  to  produce  and  direct  it,  explosives  and  their  use  in  warheads,  kinetic  energy  weapons, 
weapons  of  mass  destruction  (WMD  - nuclear,  chemical,  and  biological  weapons),  and  even 
nonlethal  weapons.  Delivery  systems  include  guns,  bombs,  and  missiles.  Delivery  systems  of  all 
kinds  can  be  understood  by  understanding  a few  basic  elements.  These  elements  include  basic 
aerodynamics  (the  forces  that  act  on  bodies  in  motion),  control  systems  (the  way  in  which  feedback 
systems  respond  to  commands  and  external  disturbances),  guidance  (determination  of  desired 
motional  characteristics),  actuation  (transformation  of  desired  motion  into  actual  motional 
characteristics),  propulsion  (the  means  of  acceleration  objects  to  high  speeds  and/or  maintaining 
those  speeds),  and  ballistics  (the  motions  of  projectiles).  Special  characteristics  of  WMD  delivery 
systems  also  need  discussion. 

Data  must  be  exchanged  between  sensors  and  weapons  in  order  to  perform  the  desired 
missions.  This  is  one  of  the  functions  of  the  control  elements.  Communications  systems  permit 
data  to  be  exchanged  between  separate  system  elements  as  well  as  between  platforms.  In  one  sense 
communications  links  are  very  similar  to  sensor  systems  and  their  analysis  shares  many  things  in 
common  with  sensor  analysis.  Communication  systems  are  often  organized  into  networks  to 
permit  easy  sharing  of  important  data.  Computers  store  and  process  data  and  convert  data  into 
elements  of  knowledge  that  may  be  used  to  guide  future  actions.  Modern  computer  systems  use  a 
variety  of  specialized  processing  elements.  These  elements  perform  special  kinds  of  arithmetic  and 
memory  & recall  functions  and  are  assembled  and  organized  in  many  different  ways  to  form  what 
is  commonly  referred  to  as  computers.  The  organization  of  these  elements  is  called  computer 
architecture.  Architecture  is  also  used  to  describe  the  way  in  which  communications  systems  are 
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organized  as  well  as  the  way  in  which  complete  weapons  systems  are  organized  and  interconnected. 
It  is  a critical  topic  to  be  assimilated  and  internalized.  Complete  weapons  systems  must  also 
accommodate  the  specialized  systems  used  for  navigation  and  for  intelligence  gathering, 
dissemination,  and  analysis.  Since  all  systems  still  require  human  beings  to  perform  critical 
decision  functions,  it  is  important  to  understand  aspects  of  human/machine  interfaces.  This  field 
addresses  subjects  such  as  displays,  symbology,  input/output  devices,  and  controls  (switches, 
knobs,  etc.)  and  how  to  design  such  interfaces  to  maximize  information  transfer  and  minimize 
operator  error. 

Sensors,  weapons,  and  control  elements  do  not  of  themselves  constitute  complete  and 
effective  combat  systems.  A combination  of  a thermal  imager,  a laser-designated  missile,  and  a 
radio  command  guidance  link  can  perform  all  of  the  functions  desired  of  a weapon  system,  except 
that  it  won’t  work.  The  pieces  cannot  properly  interact  with  each  other.  Integration  is  the  key  to 
functioning  systems.  Key  to  this  task  is  the  rational  selection  of  subsystems  and  the  intelligent 
placement  of  these  subsystems  on  the  platform  destined  to  carry  them.  Integration  is  a task  that 
must  be  iterative.  The  initial  selection  and  placement  will  invariably  create  integration  problems 
that  must  be  addressed.  For  example,  it  may  be  necessary  to  physically  align  the  reference  axis  of 
a thermal  imager  with  that  of  a laser  designator.  Initial  placement  of  the  thermal  imager  high  on  a 
mast  and  the  laser  designator  with  the  missile  launcher  on  a deck  poses  alignment  problems.  The 
mast  is  a flexible  structure  that  can  move  relative  to  the  deck,  the  relative  motion  makes  alignment 
a difficult  if  not  insurmountable  problem.  It  would  be  better  to  co-locate  the  imager  and  the 
designator.  Selected  systems  may  have  electromagnetic  compatibility  problems.  It  is  common  for 
radars  to  set  off  electronic  warfare  systems  and  for  those  electronic  warfare  systems  to  attempt  to 
jam  the  radars.  Virtually  any  sensor  or  weapon  element  may  cause  compatibility  problems  or  be 
susceptible  to  compatibility  problems  with  other  elements.  Safety  issues  are  rampant  with  sensors, 
weapons,  and  communications  systems.  Radar  emissions  can  kill  and  lasers  can  blind;  heat  and 
blast  from  rocket  motors  are  clear  hazards;  even  communications  antennas  carry  voltages  that  can 
shock  individuals  who  may  contact  them.  The  survivability  of  the  platform  is  one  of  the  primary 
concerns  of  the  combat  systems  engineer.  Survivability  is  usually  considered  to  be  the  sum  of  two 
parts:  susceptibility  (the  ability  of  threat  weapons  to  be  delivered  against  the  platform)  and 
vulnerability  (the  ability  of  threat  weapons  to  damage  or  destroy  the  platform  once  those  weapons 
are  delivered).  Both  selection  and  placement  of  combat  systems  elements  can  affect  susceptibility 
and  vulnerability.  In  addition,  specific  platform  design  functions  that  relate  to  survivability  are 
usually  delegated  to  combat  systems.  For  example,  stealth  (low  observables)  and  armor  are  the 
province  of  the  combat  systems  engineer  not  the  naval  architect  (ship  designer).  Lastly,  it  is 
essential  that  the  entire  system  be  capable  of  being  evaluated  before  being  sent  into  combat. 
Performance  assessment  is  used  to  evaluate  all  of  the  integration  decisions  made  by  the  combat 
systems  engineer.  Any  decision  found  wanting  needs  to  be  revisited  (necessitating  the  iterative 
character  of  integration).  Development  of  effective  integrated  combat  systems  requires  consider- 
ation of  all  of  the  elements  of  this  “Big  Picture”. 

This  series  of  books  is  intended  to  capture  the  “Big  Picture”  as  described  above.  At  the  time 
of  writing  this  foreword  to  the  first  volume,  the  complete  work.  Combat  Systems,  is  expected  to  be 
five  volumes  in  length.  This  approach  was  suggested  by  the  practical  need  to  use  the  text  as  it  was 
being  written.  The  lectures  notes  from  which  the  text  is  being  generated  are  used  for  at  least  eight 
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different  courses  at  NPS  (the  two  TSSE  courses  and  six  other  sensors  and  weapons  courses  in  other 
curricula).  A single  volume  would  have  been  too  massive  (over  2000  pages)  and  would  have 
delayed  availability  of  all  of  the  included  information  for  several  years.  The  five  volumes: 

* Sensor  Functional  Characteristics 

* Sensor  Technologies 

* Conventional  Weapons 

* Unconventional  Weapons 

* Control  Elements  and  Platform  Integration 

are  intended  to  be  used  in  sequence,  but  have  been  made  as  self-contained  as  possible. 

For  example.  Volume  1 discusses  the  nature  of  sensor  observables,  the  effects  of  propaga- 
tion through  the  environment,  the  functions  that  sensors  perform,  and  the  ways  in  which  the 
performance  of  those  functions  are  analyzed.  Volume  1 presents  a minimal  set  of  tools  needed  to 
perform  analyses  and  a theoretical  framework  within  which  any  and  every  kind  of  sensor  can  be 
analyzed  in  a fashion  that  permits  direct  comparisons  and  understanding  through  analogies. 
Appendices  augment  the  material  and  reduce  the  necessity  for  reference  to  other  works  in  order  to 
solve  realistic  problems.  A student  who  thoroughly  understood  the  material  in  Volume  1 would 
know  almost  everything  needed  to  be  able  to  understand  a totally  new  and  generic  sensor  system. 
However,  he  would  know  very  little  about  any  specific  sensor  technology,  such  as  radar  or  sonar. 

Volume  2 discusses  specific  sensor  technologies  applicable  to  military  systems.  It  treats 
every  major  kind  of  radar,  infrared,  electro-optical,  acoustic,  nuclear  sensor  system,  as  well  as  a few 
others.  Each  sensor  is  discussed  in  terms  of  sensor  architecture,  critical  components  and  their 
function,  key  mission  uses,  and  useful  design  relations.  Analogy  is  used  extensively.  Once  a 
technology  or  phenomenon  has  been  discussed  with  respect  to  one  sensor  type,  discussions  of  the 
same  technology  or  phenomenon  in  other  sensor  types  is  done  by  analogy.  The  author  has 
attempted  to  discuss  every  major  sensor  class  in  Volume  2.  Both  realized  and  potential  future 
performance  are  evaluated.  Sensor  designers  with  a good  grasp  of  one  sensor  type  will  often  find 
that  study  of  Volume  2 alone  is  enough  to  gain  a good  understanding  of  other  sensor  types. 
Novices  will  need  to  study  Volume  1 first. 

Volume  3 discusses  conventional  weapons  and  weapons  systems.  It  breaks  the  discussion 
into  two  major  divisions  of  weapons:  electromagnetic  weapons  (information  warfare,  directed 
energy  weapons,  and  electronic  warfare)  and  projectile  weapons  (guns,  missiles,  bombs,  torpedoes, 
etc.).  Common  aspects  of  each  weapon  type  are  pointed  out  as  are  differential  aspects  of  individual 
weapons.  Aspects  of  key  components  such  as  warheads,  guidance,  and  propulsion  are  discussed  in 
separate  chapters.  Volume  3 covers  the  gamut  of  conventional  weapons  useful  for  military 
purposes. 

Volume  4 addresses  unconventional  weapons.  In  this  broad  category  we  have  placed 
weapons  of  mass  destruction  (nuclear,  chemical,  biological,  and  radiological  weapons),  nonlethal 
weapons,  and  potential  weapons  of  the  future  (weather  warfare,  tectonic  weapons,  gravitic  weapons, 
advanced  nuclear  weapons,  nanoweapons,  cyborgs,  clones,  engineering  warriors,  etc.).  Design 
characteristics,  means  of  employment,  and  effects  are  presented  for  each  type  of  unconventional 
weapon. 


Volume  5 discusses  control  elements  (communications  and  processing  systems)  and  the 
overall  integration  of  sensors,  weapons,  and  control  elements  with  the  platforms  to  achieve  usable 
and  complete  combat  systems.  Elements  of  survivability,  observables  reduction,  structural 
hardening,  operations  analysis,  electromagnetic  compatibility,  and  system  safety  are  necessarily 
covered  here.  Volume  5 is  a candidate  for  possible  division  into  two  separate  volumes  in  the  future, 
but  given  the  limited  time  available  to  cover  these  last  topics  in  the  courses  for  which  the  books  are 
being  developed,  this  is  not  likely  to  occur  before  a second  or  later  edition. 

For  the  most  part,  Volumes  3 and  4 can  be  used  and  studied  independent  of  Volumes  1 and 
2.  However,  as  sensors  play  a role  in  many  practical  weapons  systems,  prior  study  of  Volumes  1 
and  2 will  proved  beneficial  in  appreciating  Volumes  3 and  4.  Volume  5 can  be  studied  without 
prior  study  of  Volumes  1-4.  The  principles  are  understandable  and  applicable  in  their  own  right. 
However,  the  presentation  of  these  principles  is  geared  to  match  the  information  and  presentation 
in  the  earlier  volumes 

Each  chapter  of  each  volume  is  augmented  by  references  to  sources  of  specific  results  being 
quoted  or  of  further  information  on  key  topics.  A limited  set  of  problems  is  also  presented.  These 
may  be  used  for  homework.  However,  their  primary  use  is  to  indicate  the  kinds  of  problems  that 
a combat  systems  engineer  may  encounter  and  to  provide  a rationale  for  the  topics  being  covered. 

The  author  welcomes  comments,  corrections,  suggested  improvements,  interesting 
problems,  anecdotes,  examples  of  the  application  of  various  principles,  sample  data  from  real  sensor 
systems  (if  publishable),  and  additional  topics  for  inclusion.  This  work  has  been  evolving  for 
nearly  a decade.  It  should  continue  to  evolve  in  the  future.  The  author  can  be  contacted  via  regular 
mail  at: 


Robert  C.  Harney,  Ph.D. 
Department  of  Systems  Engineering 
Naval  Postgraduate  School 
Monterey  California,  93943 

or  via  electronic  mail  at: 

hamey@nps.edu. 

Robert  C.  Harney 
Febraary  2005 
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PARTI 


SENSOR  FUNCTIONAL  CHARACTERISTICS 
PREFACE 


This  section  addresses  those  attributes  which  are  common  to  broad  classes  of 
sensor  system.  For  example,  the  external  environment  dramatically  affects  the 
performance  of  all  sensor  systems.  Similarly,  the  ability  to  perform  critical  functions 
such  as  detection  or  tracking,  can  be  related  to  certain  universal  sensor  characteristics 
in  an  unambiguous  fashion.  Among  these  common  attributes  are  the  characteristics 
of  observable  signals,  the  effects  of  propagation  through  the  environment,  and  the 
functions  desired  to  be  performed.  Other  than  their  use  as  examples,  specific  sensor 
technologies  are  not  discussed  until  Part  II. 

Specifically,  this  section  describes  a set  of  tools  that  can  be  used  by  the  analyst 
to  predict  and  compare  the  performance  of  any  kinds  of  sensor  systems.  Use  of  these 
tools  permits  “apples  to  apples”  trade  studies  to  be  made  between  sensors  with  little 
or  no  physical  similarity. 

In  addition,  it  will  become  apparent  that  even  after  a detailed  discussion  of  radar 
or  sonar  components,  integration,  and  operation,  the  student  will  be  unable  to  predict 
the  performance  of  any  radar  or  sonar  until  after  he  has  learned  how  to  obtain  and 
incorporate  the  results  of  detection  theory,  estimation  theory,  and  propagation  theory, 
among  others. 

This  material  is  presented  first  in  the  hopes  of  instilling  a recognition  that 
sensors  can  be  studied  as  a unified  topic.  In  this  context,  attempts  to  impart  “special” 
status  to  certain  sensors  such  as  “radar”  or  “sonar”  will  be  seen  to  be  not  only 
superficial  and  but  also  potential  causes  of  less-than-optimal  designs. 
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CHAPTER  1 


SIGNATURES,  OBSERVABLES,  & PROPAGATORS 


Sensors,  Transducers,  and  Detectors 

This  volume  deals  with  sensors  and  their  performance.  According  to  the  dictionary,  a sensor 
is: 

“a  device  that  responds  to  a physical  stimulus  (as  heat,  light,  sound,  pressure,  magnetism, 
or  a particular  motion)  and  transmits  a resulting  impulse  (as  for  measurement  or  operating 
a control). " [1] 

For  purposes  of  this  text,  the  author  prefers  his  own  slightly  more  pragmatic  definition.  A sensor 
is: 

“a  device  that  measures,  observes,  detects,  or  otherwise  characterizes  one  or  more 
attributes  of  an  object,  a collection  of  objects,  or  the  external  environment.  ” 

In  order  to  perform  these  tasks,  a sensor  must  take  information  about  the  target  that  is  expressed  in 
one  form  (emitted  or  scattered  waves,  fields,  or  particles)  and  convert  them  into  another  form 
(currents  in  a wire  or  nerve  impulses  transmitted  to  our  brains).  The  dictionary  defines  a transducer 
as: 

“a  device  that  is  actuated  by  power  from  one  system  and  supplies  power  usually  in  another 
form  to  a second  system  (as  a telephone  receiver  is  activated  by  electric  power  and  supplies 
acoustic  power  to  the  surrounding  air.  ” [1] 

The  author’s  broader  definition  is: 

“a  device  which  converts  information-carrying  signals  from  one  physical form  into  another 
physical  form.  ” 

The  process  of  conversion  is  called  transduction.  An  example  of  a transducer  is  a silicon  photodiode 
which  converts  light  intensity  into  electrical  current.  All  sensor  systems  contain  transducers.  In 
many  sensor  systems  we  refer  to  the  transducer  element  as  a “detector”,  especially  if  it  converts  the 
incident  signal  into  an  electrical  current. 
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Observables 


Physical  objects  possess  a myriad  of  properties:  mass,  volume,  shape,  motion,  composition, 
ability  to  emit  radiation,  ability  to  absorb  or  reflect  radiation,  etc.  Most  of  these  properties  can  be 
detected  and  measured  with  appropriate  sensors.  Any  property  of  an  object  that  is  measured  or 
otherwise  observed  is  called  an  observable  of  that  object.  For  example,  mass  can  be  measured  (in 
principle)  by  a scale.  Mass  is  a potential  observable  of  the  object.  Mass  also  creates  a gravitational 
field,  and  the  resulting  field  can  be  measured  (in  principle)  by  a gravitometer.  Thus  the  gravitational 
field  of  an  object  is  a potential  observable.  If  an  observer  actually  possesses  and  uses  a sensor 
capable  of  measuring  a potential  observable,  it  becomes  a real  observable.  Any  object  possesses 
hundreds  of  potential  observables  (too  many  to  attempt  to  list),  however,  any  practical  sensor  system 
or  suite  of  sensors  will  only  exploit  a few  real  observables. 

If  there  is  a direct  physical  relationship  between  two  observables,  then  measurement  of  one 
can  be  used  to  infer  the  value  of  the  other.  The  relationship  between  mass  and  gravitational  field 
strength  is  well  established,  so  measurement  of  one  allows  us  to  unequivocally  predict  the  value  of 
the  other.  If  this  is  true,  then  we  assume  that  both  quantities  are  observables  of  the  system. 

Observables  can  have  measurable  values  that  range  from  very  small  to  very  large.  Usually, 
the  ability  of  a sensor  system  to  measure  an  observable  depends  on  the  magnitude  of  its  physical 
value.  The  property  of  being  able  to  measure  values  of  an  observable  other  than  zero  is  called 
observability.  Objects  with  limited  observability  are  hard  to  detect  and  measure  and  are  said  to  be 
low  observable.  Stealth  is  the  technology  of  designing  objects  to  possess  low  observability  and/or 
reducing  the  value  of  currently  possessed  observables.  Stealth  and  observables  reduction  will  be 
discussed  in  Volume  3 of  this  series. 


4 


Signatures  and  Propagators 

The  measured  value  of  any  observable  is  called  a signature  of  the  object.  The  dictionary  [2] 
defines  a signature  as  a noun: 

“a  characteristic  trace  or  sign  that  indicates  the  presence  of  a substance  or  the  occurrence 
of  a physical  event.  ” 
and  as  an  adjective: 

“serving  to  identify  or  distinguish  a person  or  group.  ” 

As  in  the  earlier  section,  we  shall  broaden  the  definition  of  the  noun  to: 

“a  characteristic  value  of  an  observable  that  identifies  the  presence  of  an  object,  the 
occurrence  of  an  event,  and/or  aids  in  identifying  or  distinguishing  an  object  from  other 
objects.  ” 

Although  the  dictionary  definitions  imply  uniqueness  to  signatures,  just  as  one’s  signature  is 
supposed  to  identify  you  sufficiently  uniquely  that  when  it  is  placed  on  a check,  your  bank  will  give 
your  money  to  someone  else.  However,  the  mark  “X”  is  still  used  as  a valid  signature  for  illiterate 
persons.  Clearly,  X does  not  convey  much  uniqueness.  Although  the  author  believes  that  the  term 
signature  should  be  used  for  the  measured  value(s)  of  an  observable  or  observables,  especially  those 
which  discriminate  the  object  from  other  objects,  the  terms  observable  and  signature  are  often  used 
interchangeably  with  little  resulting  confusion. 

As  we  shall  see  later  in  this  volume,  all  information  is  useful.  However,  some  information 
may  not  seem  so  at  first  glance.  If  all  we  can  do  is  measure  mass,  and  we  measure  a mass  of  1000 
kg,  this  does  not  tell  us  whether  it  is  a small  car  or  a large  bomb.  However,  it  can  tell  us  that  the 
object  is  not  a man  and  it  is  not  an  armored  vehicle.  This  may  or  may  not  seem  to  be  immediately 
useful.  However,  consider  execution  of  a search  of  a classification  tree  (this  topic  will  be  discussed 
in  more  detail  in  Chapter  12).  Basically,  a classification  tree  is  a diagram  that  breaks  objects  into 
finer  and  finer  classes  giving  progressively  more  information  about  the  object.  A simple  classifica- 
tion tree  is  illustrated  in  Figure  1-1.  Any  information  that  either  directs  a search  onto  a particular 
branch  of  the  tree  or  discourages  search  of  any  branch  or  branches  is  useful  information. 


Figure  1-1.  A classification  tree  for  ground  vehicles. 


GROUND  VEHICLE 


WHEELED  VEHICLE 


OTHER  VEHICLE  TRACKED  VEHICLE 


CAR  MOTOR-  LIGHT  ARMORED  HEAVY 
CYCLE  TRUCK  PERSONNEL  TRUCK 
CARRIER 


SLEDGE  HALF- 
TRACK 


PERSONNEL 

CARRIER 
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People  who  work  with  or  design  sensors  are  familiar  with  a limited  number  of  observables. 
However,  as  we  discussed  in  the  previous  section,  there  is  an  enormous  variety  of  observables  and 
a correspondingly  large  variety  of  sensors.  To  gain  a better  appreciation  of  this  variety  we  have 
attempted  to  classify  all  known  observables  in  several  different  ways.  Such  classifications  serve  not 
only  to  provide  a semblance  of  order  to  a chaotic  situation,  but  also  sets  the  stage  for  possible  future 
identification  of  novel  (at  least  to  most  people)  observables. 

One  distinct  mode  of  classification  is  by  the  propagator  of  the  observable  information.  A 
propagator  is  the  physical  entity  that  carries  the  observable  information.  If  the  observable  is  the 
radar  cross  section  of  a target,  the  propagator  is  electromagnetic  radiation.  If  the  observable  is  the 
sounds  emitted  by  a target,  the  propagator  is  acoustic  radiation.  Based  on  his  analysis  and 
observations,  the  author  has  identified  six  distinctly  different  classes  of  propagator.  These  are  listed 
along  with  examples  of  observables  associated  with  each  class  of  propagator  and  significant 
subclasses  of  propagator  in  Table  1-1. 

We  have  already  identified  electromagnetic  radiation  as  a major  propagator  of  observable 
information.  As  we  shall  see  in  the  next  chapter  there  are  several  major  and  numerous  minor 
subdivisions  of  the  electromagnetic  spectrum  (for  example,  radio  frequency,  microwave,  millimeter- 
wave,  infrared,  visible,  and  ultraviolet).  Individual  sensors  seldom  work  over  more  than  one  of  these 
subdivisions.  Acoustic  radiation  is  another  major  propagator.  Sound  waves  carry  information 
through  the  air  and  through  water.  Seismic  waves  carry  acoustic  information  through  the  ground. 
Acoustic  and  electromagnetic  radiation  are  the  propagators  associated  with  most  of  the  sensors  with 
which  military  personnel  are  familiar.  Radars,  sonars,  thermal  imagers,  television  imagers,  image 
intensifiers,  laser  rangefinders,  etc.,  all  use  either  acoustic  or  electromagnetic  radiation.  Nuclear 
radiation  is  another  significant  propagator  of  information.  The  author  has  included  x-rays  and 
gamma  rays  in  the  class  of  nuclear  radiation  propagators,  even  though  they  are  forms  of  electromag- 
netic radiation.  The  rationale  for  this  involves  the  observation  that  x-rays  and  gamma  rays  tend  to 
be  associated  with  the  same  kinds  of  physical  processes  with  which  neutrons  or  alpha  particles  are 
associated.  Furthermore,  x-ray  and  gamma  ray  detectors  tend  to  have  different  designs  than 
electromagnetic  detectors  operating  at  lower  energies  such  as  the  ultraviolet.  In  this  case,  the  author 
decided  that  practical  aspects  outweighed  theoretical  ones. 

Radiation  is  not  the  only  means  by  which  an  observable  may  be  propagated.  Atoms  and 
molecules  can  be  propagators.  Internal  combustion  engines  emit  distinct  chemical  pollutants  (e.g., 
ethylene  and  carbon  monoxide)  in  their  exhausts.  The  water  vapor  produced  in  jet  engines  may  form 
contrails  (condensation  trails).  One  of  the  major  functions  of  the  Air  Weather  Service  is  to  calculate 
probabilities  of  contrail  formation  to  permit  military  flights  to  be  routed  to  minimize  the  production 
of  unmistakably  observable  contrails.  Human  beings  emit  detectable  odors.  There  were  multiple 
sensor  systems  developed  during  the  Vietnam  War  that  were  dropped  through  the  jungle  canopy 
along  the  Ho  Chi  Minh  Trail  [3].  These  sensors  were  designed  to  detect  the  ammonia  released  from 
urine.  When  the  sensors  detected  the  presence  of  urine  in  an  area,  air  strikes  would  be  called  in  to 
bomb  the  suspected  areas.  In  this  instance  we  probably  killed  more  water  buffalo  than  North 
Vietnamese  soldiers,  but  the  sensors  worked.  Chemical  warfare  agents  often  propagate  their  own 
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Table  1-1.  Classification  schemes  of  observables. 


BY  PROPAGATOR 

ELECTROMAGNETIC 

ACOUSTIC  RADIATION 
NUCLEAR  RADIATION 

ATOMS  & MOLECULES 

ENERGY  & MOMENTUM 

FIELDS 

BY  SOURCE 

EMISSION 

REFLECTION/SCATTERING/ 

ABSORPTION 

OCCULTATION 

INELASTIC  SCATTERING 

RESIDUE 

ANOMALY 


RELATED  OBSERVABLES 

* ULTRAVIOLET,  VISIBLE,  INFRARED,  MILLIMETER- 
WAVE,  RADIATION  MICROWAVE,  OR  RADIO 
FREQUENCY  RADIATION 

* SOUND  WAVES,  SEISMIC  WAVES 

* X-RAYS,  GAMAAA  RAYS,  NEUTRONS,  ALPHA 
PARTICLES,  BETA  PARTICLES 

* ENGINE  EXHAUSTS,  PERSONNEL  ODORS, 
CHEMICAL  AGENTS,  CONTRAILS 

* WAKE  TURBULENCE,  SHOCKWAVES, 
ENVIRONMENTAL  HEATING,  TURF/FOLIAGE 
DAAAAGE 

* MAGNETIC  FIELDS,  ELECTRIC  FIELDS, 

GRAVITY,  ELECTRIC  CONDUCTIVITY, 
AAAGNETIC  PERMEABILITY 


RELATED  OBSERVABLES 

* INTENTIONAL  (ON-BOARD  RADAR,  SONAR, 
RADIO,  OR  LASER  RANGER) 

* INADVERTENT  (EMI,  ACOUSTIC  NOISE, 
THERAAAL  RADIATION,  RADIOACTIVE  DECAY) 

* NATURAL  ILLUMINATION  (VISION,  TELEVISION) 

* ACTIVE  ILLUMINATION  (RADAR,  SONAR,  LASER) 

* SHADOWS,  HOLES  IN  BACKGROUND  NOISE 

* FREQUENCY-SHIFTED  RADIATION 

* INDUCED  EMISSIONS 

* KINEMATIC  PERTURBATIONS 

* MOLECULAR  (EXHAUST,  CONTRAIL,  ODORS) 

* ENERGY  (WAKE  TURBULENCE,  TURF/FOLIAGE 
DAAAAGE  ENVIRONMENTAL  HEATING) 

* FIELD  STRENGTH,  FIELD  DIRECTION,  INTER- 
ACTION STRENGTH 
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observable  signatures  in  the  form  of  their  vapors.  The  atoms  or  molecules  may  be  detected  directly 
(e.g.,  with  a mass  spectrometer  or  a wet  chemical  sensor)  or  they  may  be  detected  indirectly  by  using 
electromagnetic  radiation  or  some  other  propagator.  Such  hybrids  involving  more  than  one 
propagator  often  even  more  possibilities  for  sensor  design. 

Energy  and  momentum  can  act  as  propagators.  Many  people  have  felt  the  shock  wave 
produced  by  a supersonic  aircraft.  This  is  an  example  of  an  energy/momentum  propagator.  Hearing 
the  sonic  boom  is  a hybrid  propagator  as  it  involves  acoustic  waves  as  well.  Swimmers  have  felt 
the  turbulence  in  the  wake  of  a passing  speedboat.  Those  who  see  the  wake  experience  another 
hybrid  propagator.  Environmental  heating  produced  by  the  presence  of  a warm  object  is  another 
energy  propagator  example.  Lastly,  the  expenditure  of  energy  and  momentum  in  a location  can 
cause  physical  alteration  of  structures  in  that  environment.  Simple  examples,  like  the  foliage 
damage  or  turf  displacement  that  is  produced  when  an  animal  moves  through  the  brush  have  been 
exploited  by  hunters  and  trackers  for  millennia.  Finally,  fields  can  be  propagators.  Many  objects 
generate  electric  fields,  magnetic  fields,  or  gravitational  fields.  A magnetohydrodynamic  propulsion 
unit  may  create  a magnetic  field  that  is  measurable  at  considerable  distances.  Other  objects  may 
modify  the  characteristics  of  external  fields.  Different  objects  or  conditions  can  alter  the  interactions 
characteristics  of  fields.  Magnetic  permeability,  or  electric  conductivity  are  examples  of  interaction 
strengths  that  can  carry  information.  For  example,  the  conductivity  of  soil  is  a function  of  its 
moisture  content  and  density.  Both  moisture  content  and  density  are  different  in  an  area  of  disturbed 
soil  (such  as  that  produced  by  digging  a hole  to  bury  a landmine)  compared  to  undisturbed  soil. 
Sensor  technologies  exist  that  are  capable  of  measuring  fields  and  interaction  strengths  of  all  kinds. 

Observables  can  also  be  categorized  by  their  observable  source.  Figure  1-1  also  illustrates 
this  second  categorization.  Emission  is  a prominent  source  of  information.  Emission  may  be 
intentional  such  as  that  produced  by  sensors  (radar,  sonar,  laser  ranger,  radios,  etc.)  on  board  the 
platform  or  it  may  be  inadvertent,  produced  by  natural  processes  (thermal  emission  or  radioactive 
decay)  or  by  artificial  processes  (such  as  electromagnetic  interference  or  acoustic  noise). 
Reflection,  scattering,  and/or  absorption  of  radiation  is  another  prominent  source  of  observable 
characteristics.  The  source  of  the  radiation  may  be  natural  illumination  such  as  sunlight  or 
earthshine  (thermal  emission  from  the  surroundings)  or  it  may  be  produced  by  active  illumination 
by  sensors  such  as  radars,  sonars,  or  laser  sensors.  Television  systems  and  image  intensifier  systems 
may  used  either  natural  or  active  illumination.  Related  to  absorption  but  qualitatively  different  is 
occultation,  the  loss  of  a background  signal  due  to  blockage  of  that  signal  by  nearby  objects.  A 
shadow  is  a form  of  occultation.  So  is  the  loss  of  background  acoustic  noise  when  it  is  blocked  by 
a submarine  coated  with  acoustic  coatings  or  the  loss  of  blue  sky  (or  night  sky)  background  when 
an  air  target  passes  through  the  sensor  line  of  sight.  Lack  of  signal  is  a useful  form  of  signal  in  its 
own  right. 

Inelastic  scattering  is  the  source  of  three  other  kinds  of  observable.  Inelastic  scattering  is 
any  process  in  which  one  object  interacts  with  a second  object  and  loses  (or  gains)  energy  from  the 
interaction.  One  or  both  objects  may  change  their  fundamental  character  in  the  interaction. 
Radiation  may  interact  with  atoms  or  molecules  and  emerge  with  characteristic  shifts  in  frequency. 
For  example,  Raman  scattering  is  a process  in  which  electromagnetic  radiation  with  a fixed  input 
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frequency  scatters  from  molecules  and  emerges  with  different  frequencies  which  have  been  shifted 
by  the  vibrational  and/or  rotational  frequencies  of  the  molecule.  Induced  emissions  are  different 
form  of  inelastic  scattering.  Nuclear  particles  incident  on  certain  materials  can  be  absorbed  with  the 
resultant  emission  of  different  particles.  Neutrons  incident  on  fissionable  materials  will  react 
producing  very  distinctive  emissions  that  can  identify  the  material  as  being  fissionable.  Kinematic 
perturbations  can  be  a source  of  observable  characteristics.  Every  hunter  or  infantryman  is  familiar 
with  the  change  in  motion  of  a target  when  it  is  hit  by  a rifle  bullet.  In  a more  bizarre  application, 
there  have  been  serious  proposals  to  discriminate  reentry  vehicles  from  decoys  by  bombarding  them 
with  sand  (or  possibly  particle  beams).  Reentry  vehicles  are  heavy  and  will  not  have  their  velocity 
changed  significantly  by  a small  change  in  momentum.  Light  decoys  such  as  balloons  will  slow 
down  considerably  when  they  are  hit  with  a small  amount  of  high-velocity  sand. 

Residues  are  another  source  of  observables.  Molecular  residues  are  vapors  and  odors 
produced  by  equipment  or  personnel.  They  also  include  fuel  leakage,  oil  slicks,  contrails,  smoke 
from  engine  exhausts  or  cooking  fires,  etc.  Energy  residues  include  environmental  damage, 
environmental  heating,  wakes,  shock  waves,  turbulence,  etc.  Residues  may  yield  the  knowledge  that 
certain  kinds  of  system  are  (or  were)  present  in  the  vicinity,  without  requiring  direct  observation  of 
those  systems.  The  last  major  source  of  observables  is  anomalies.  An  anomaly  is  an  abnormal 
condition  observed  in  a typically  static  quantity.  Anomalies  may  exist  in  field  strength,  field 
direction,  or  interaction  strength.  For  example,  an  iron  object  will  modify  the  earth’s  magnetic  field 
in  a characteristic  fashion.  At  large  distances,  the  change  in  the  Earth’ s field  (the  magnetic  anomaly) 
will  have  a characteristic  dipole  shape. 

A final  way  of  categorizing  observables  is  on  the  basis  of  their  relative  strength.  Signal- 
dominant  observables  are  characterized  by  a well-defined  signal  corrupted  by  a small  amount  of 
noise  usually  produced  in  the  sensor  itself.  Radars  and  infrared  sensors  looking  at  sky  backgrounds 
are  examples  in  which  the  signal  dominates.  Clutter-dominant  observables  are  characterized  by 
signals  that  do  not  differ  very  much  in  strength  from  the  background.  The  signal  must  be  extracted 
based  on  small  contrasts  against  the  background.  Variations  in  the  background  (clutter)  may  exceed 
the  contrast  of  the  target  against  the  background.  Imaging  sensors,  sonars,  and  radars  looking  at 
terrestrial  backgrounds  are  often  dominated  by  clutter.  Typically,  different  kinds  of  signal 
processing  must  be  employed  when  dealing  with  signal-dominant  observables  than  when  dealing 
with  clutter-dominant  observables.  These  distinctions  will  be  addressed  further  when  we  discuss 
sensor  functions  and  sensor  implementations. 

The  primary  reason  for  this  chapter  is  the  indication  to  the  reader  that  there  is  much  more 
to  military  sensor  systems  than  initially  meets  the  eye.  Weapons  systems  have  seriously  exploited 
only  two  of  the  six  classes  of  propagators  and  only  two  of  the  six  classes  of  observable  sources.  As 
threats  evolve  to  counter  the  capabilities  of  existing  sensor  systems  we  should  be  looking  to  novel 
ways  of  performing  our  mission.  This  will  involve  looking  at  sensors  that  exploit  observables  that 
are  not  being  considered  today.  Today’s  sensors  use  obsolete  technology.  The  sensors  under 
development  today  will  be  obsolete  by  the  time  they  are  fielded.  Technology  will  continue  to  evolve 
at  an  exponential  rate.  As  we  progress  through  this  text,  every  attempt  will  be  made  to  provide  all 
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of  the  conceptual  tools  that  will  permit  the  reader  to  address  non-traditional  or  even  “science  fiction” 
sensor  systems  as  they  are  developed,  to  an  equivalent  degree  of  sophistication  as  more  traditional 
sensors  are  treated  today. 
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Problems 


1-1.  For  each  “propagator”  of  observables  in  Table  1-1  give  at  least  one  concrete  example  of  a 
sensor  using  that  observable.  List  as  many  examples  as  you  can,  but  avoid  duplication  of 
basic  principles. 

1 -2.  For  each  “source”  of  observables  in  Table  1-1  give  at  least  one  concrete  example  of  a sensor 

using  that  source.  List  as  many  examples  as  you  can,  but  avoid  duplication  of  basic 
principles.  Every  example  listed  in  Problem  1-1  should  be  categorized  in  Problem  1-2. 

1 -3 . For  some  military  mission  with  which  you  are  familiar,  hypothesize  a sensor  to  perform  key 

aspects  of  that  mission.  However,  you  must  choose  a sensor  whose  propagator  and  source 
are  not  currently  used  or  proposed  to  perform  that  mission. 
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CHAPTER  2 


PROPAGATION  OF  ELECTROMAGNETIC  RADIATION.  I. 
FUNDAMENTAL  EFFECTS 


What  is  Electromagnetic  Radiation? 

Electromagnetic  radiation  consists  of  simultaneously  oscillating  electric  and  magnetic  fields 
that  can  move  from  one  point  in  space  to  another  preserving  certain  fundamental  characteristics  of 
the  oscillations.  Of  primary  interest  is  the  fact  that  these  oscillating  fields  can  transport  energy  from 
one  point  in  space  to  another.  No  physical  medium  is  required. 

An  old  joke  among  physicists  is  based  on  Genesis  1:1  through  1:3  of  the  Bible  [1].  In  the 
language  of  the  “King  James  version”  it  goes: 

“In  the  beginning,  God  created  the  heaven  and  the  earth.  And  the  earth 

was  without  form,  and  void;  and  darkness  was  upon  the  face  of  the  deep. 

And  the  Spirit  of  God  moved  upon  the  face  of  the  waters. 

And  God  said: 


V D = Ajtp, 

(2.1) 

VB  = 0, 

(2.2) 

vxi  + ^ = 0, 

c dt 

(2.3) 

c dt  c 

(2.4) 

And  there  was  light! 

The  humor  comes  from  the  fact  that  the  existence  of  electromagnetic  radiation  (i.e.,  light)  is  a direct 
and  inevitable  consequence  of  Maxwell’s  Equations  - equations  (2.1)  through  (2.4)  above.  Had 
“God”  uttered  Maxwell’s  equations,  instead  of  the  Scriptural  “Let  there  be  light  T,  there  would  have 
been  light  without  further  ado.  Equations  (2.1)-(2.4)  plus  the  constitutive  relations: 

D = E + AtlP  = E ~ Xe)^  ~ (2-^) 
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and 


(2.6) 


B^H  + AkM  = (l  + 4;r^)#  = /Ji 

can  together  be  easily  solved  to  yield  wave  equations. 

Before  performing  this  mathematical  piece  de  resistance  the  multiplicity  of  terms  used  above 
needs  to  be  defined.  Specifically: 

E - Electric  Field 
D - Electric  Displacement 
P = Electric  Polarization 
- Electric  Susceptibility 
s = Dielectric  Constant 
B - Magnetic  Induction 
H - Magnetic  Field 
M = Magnetization 
X^  = Magnetic  Susceptibility 
/a  = Magnetic  Permeability 
c = Speed  of  Light  in  Vacuum 
p - Charge  Density 
J = Current  Density 

The  underlined  quantities  are  complex  constants.  Now  assume  that  the  region  of  space  of  interest 
is  empty  (i.e.,  charge  density  and  current  density  are  zero).  In  empty  space,  Maxwell’s  equations 
may  be  written  as 


y-E  = 0, 

(2.1a) 

VB  = 0, 

(2.2a) 

VxE  + ^ = 0, 

c dt 

(2.3a) 

ps  dE 

V X 5 = 0 . 

c dt 

(2.4a) 
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If  we  take  the  curl  (vx)  of  equations  (2.3a)  and  (2.4a)  we  obtain 


and 


^ ^ ^ 1 aV  X 5 ^ 

VxV  xE  + = 0 

c dt 


IJ.S  av  X E 

VxVxB  - ^ = 0 


c dt 

If  we  now  use  an  identity  relationship  from  multidimensional  calculus 

Vx  Vxf  = V(v-f)  - V"f 


(2.7) 


(2.8) 


we  obtain 


and 


V(V-^)  - V^E  + = 0 

^ ’ c dt 

v(V-5)  - V^B  - ^ 0 

^ ^ c dt 


(2.9) 

(2.10) 
(2.11) 


Upon  substitution  of  the  original  curl  and  divergence  equations  (2.1a)-(2.4a)  back  into  Eq.  (2.10) 
and  (2.1 1)  we  obtain  the  wave  equation  solutions 


dt^ 

(2.12) 

and 

=0. 

c'  de 

(2.13) 

Equations  (2.12)  and  (2.13)  admit  solutions  of  the  form 

(2.14) 

and 

5(r,0  = 

^ ia>t  + inlkQ-r) 

B^e 

(2.15) 

where 

n = .JJis 

; = for  non  - magnetic  media 

(2.16) 

and 

0)  = ck^ 

= cAtq  = Ittc/  a . 

(2.17) 
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These  solutions  oscillate  sinusoidally  in  both  space  and  time  with  temporal  frequency  co/27i  and  free- 
space  wavelength  X,  as  illustrated  in  Figure  2-1. 


Figure  2-1.  Graphical  representation  of  electromagnetic  radiation  as  simultaneous 
in-phase  oscillations  of  electric  and  magnetic  fields  oriented 
at  right  angles  to  each  other  and  the  direction  of  propagation. 


PROPAGATION  VELOCITY  - c/n 


Equations  (2.14)  and  (2.15)  are  plane  waves  in  that  the  amplitude  of  the  oscillations  varies 
only  sinusoidally  along  the  direction  of  propagation  and  there  is  no  variation  in  amplitude 
perpendicular  to  the  direction  of  propagation.  The  wave  equations  admit  another  kind  of  solution, 
that  is,  spherical  waves.  The  spherical  wave  may  be  represented  as 

^icot  + ikr 

E(r,t)  = E„ (2.18) 

r 


^icot  + ikr 

B{r,t)  = B, (2.19) 

r 

Spherical  waves  either  expand  radially  outward  from  a point  or  collapse  radially  inward  on  a point. 
The  field  strength  falls  off  inversely  with  the  radial  distance  r from  the  point  of  expansion  or 
collapse.  Both  plane  and  spherical  waves  find  use  in  describing  phenomena  associated  with 
electromagnetic  radiation. 

Before  continuing  with  discussions  of  these  phenomena,  it  is  instructive  to  discuss  some  of 
the  nomenclature  associated  electromagnetic  radiation  of  different  frequencies.  Figure  2-2  shows 
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Figure  2-2.  The  electromagnetic  spectrum  from  radio  frequencies  to  the  ultraviolet. 
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the  electromagnetic  spectrum  from  radio  frequencies  to  the  ultraviolet.  In  truth,  the  electromagnetic 
spectrum  extends  well  beyond  the  ultraviolet  into  the  x-ray  and  gamma-ray  regions.  We  have  not 
included  the  very  high  frequency  x-ray  and  gamma-ray  regions  because  these  are  most  commonly 
encountered  as  by-products  of  nuclear  reactions.  We  will  discuss  these  and  other  nuclear  radiations 
in  a separate  chapter.  Electromagnetic  radiation  with  frequencies  below  30  Hz  is  detected  by 
sensors  which  are  essentially  capable  of  detecting  static  fields.  We  will  consider  them  separately 
when  we  discuss  “static”  fields  and  anomalies. 

Within  the  remaining  electromagnetic  spectrum,  many  different  groups  of  technologists  have 
applied  their  own  terminologies  to  different  regions  of  the  spectrum.  In  most  cases,  the  nomencla- 
ture was  merely  descriptive,  although  in  a few  it  was  intentionally  confusing.  In  the  early  days  of 
radio  research,  transmissions  tended  to  occur  in  the  band  from  30  kHz  to  30  MHz  without  much 
discrimination.  As  people  noticed  differences  in  propagation  characteristics,  they  started  talking 
about  low  frequency,  medium  frequency,  and  high  frequency  radio  waves,  each  covering  roughly 
a decade  of  frequency.  With  the  advent  of  radar  systems,  a great  push  to  higher  and  higher 
frequencies  began.  Radars  first  moved  into  the  very  high  frequency  (VHP)  band  and  later  into  the 
ultrahigh  frequency  (UHF)  band.  Again  it  was  convenient  to  maintain  roughly  a decade  between 
band  centers.  The  development  of  the  magnetron  permitted  radars  to  push  beyond  UHF  frequencies. 
Some  people  referred  to  the  very,  very  high  frequency  radiation  as  microwaves.  Others  continued 
using  stronger  comparatives  such  as  superhigh  frequency  (SHF)  and  ultimately  extremely  high 
frequency  (EHF).  Fortunately,  at  this  point  people  working  on  ever  higher  frequency  radio  waves 
started  to  transgress  into  the  realm  of  those  people  working  on  ever  longer  wavelength  infrared 
radiation  and  ultimately  dropped  the  race  for  stronger  comparatives.  Otherwise  we  might  be 
describing  the  near  infrared  as  very-super-ultra-extremely-high  frequency  (VSUEHF)  radio  waves 
or  some  other  ridiculous  term.  Interest  in  lower  frequency  radio  waves  led  to  the  use  of  the  terms 
very  low  frequency  (VLF)  and  extremely  low  (ELF)  radiation,  to  be  consistent  with  existing  radio 
frequency  nomenclature. 

Given  that  this  terminology  conveys  very  little  information,  other  researchers  started 
describing  the  frequency  regions  using  an  explicit  wavelength  terminology.  Waves  with  wave- 
lengths centered  around  one  meter  (i.e,  0.3  m to  3 m)  were  called  metric  waves.  The  next  longer 
wavelength  region  was  called  dekametric  waves;  the  next  short  wavelength  region  was  called 
decimetric  waves,  and  so  on  using  the  SI  (International  System  of  Units)  prefixes.  The  EHF  region 
is  most  commonly  referred  to  as  the  millimetric-wave  or  millimeter- wave  region  of  the  spectrum. 
Researchers  in  the  former  Soviet  Union  adopted  a terminology  based  on  the  exponent  of  the 
frequency  at  the  center  of  each  decade-wide  band.  Thus,  Band  6 had  a center  frequency  of  10®  Hz 
(=  1 MHz). 

During  World  War  II,  radar  was  a highly  classified  field.  To  make  it  easier  to  conduct  long- 
distance development  programs  while  retaining  a degree  of  secrecy,  the  region  of  the  spectrum  in 
which  radars  operated  was  broken  into  a large  number  of  narrow  bands  (typically  no  wider  than  an 
octave)  and  assigned  a random  letter  (L,  S,  C,  X,  K,  etc.).  Researchers  could  describe  a system  as 
operating  at  C-band  (i.e,  at  4-8  GHz)  and  maintain  a significant  level  of  security.  Only  those  with 
clearances  knew  the  band  codes.  After  the  war,  when  radar  proliferated  beyond  military  applica- 
tions, this  terminology  was  retained.  This  situation  lasted  until  the  1980's  when  the  Joint  Chiefs  of 
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Staff  promulgated  a directive  that  simplified  the  alphabet  soup  into  a simple  A,  B,  C,  D,  etc.  code 
sequence  as  shown  in  Figure  2-2. 

The  very  high  frequency  end  of  the  spectrum  is  subdivided  into  two  partially  overlapping 
regions:  the  infrared  and  the  optical  regions.  The  optical  region  spans  from  the  ultraviolet  around 
0.1  pm  to  the  near  infrared  around  1.2  pm.  The  cutoff  at  1.2  pm  is  somewhat  arbitrary.  Photons 
(quanta  of  electromagnetic  radiation)  at  wavelengths  longer  than  about  1 .2  pm  cannot  cause  external 
photoelectrons  in  material.  Thus  photomultipliers  and  image  intensifiers  can  only  be  implemented 
at  wavelengths  shorter  than  about  1 .2  pm.  The  author  has  used  this  photoelectric  cutoff  to  define 
the  edge  of  the  optical  region  of  the  spectrum.  The  optical  spectrum  is  further  subdivided  into  the 
visible  spectrum  (0.4  pm  to  0.76  pm  - the  range  of  wavelengths  most  human  beings  can  see),  the 
ultraviolet  (0.4  pm  to  0.185  pm),  and  the  vacuum  ultraviolet  (0.185  pm  to  0.1  pm  - all  known 
materials  absorb  ultraviolet  radiation  below  0.185  pm).  The  infrared  region  can  be  subdivided  in 
several  ways.  People  working  with  thermal  imaging  systems  tend  to  ignore  infrared  radiation  with 
wavelengths  longer  than  14  pm  and  shorter  than  3 pm.  They  talk  about  long-wave  infrared  (roughly 
7.5  pm  to  14  pm)  and  mid-wave  infrared  (roughly  3 pm  to  5.5  pm).  Other  researchers  have  divided 
the  infrared  spectrum  into  far  infrared  (1000  pm  to  20  pm),  mid- infrared  (20  pm  to  1 .5  pm),  and  the 
near  infrared  (1.5  pm  to  0.76  pm).  This  last  set  of  divisions  is  not  universally  accepted.  One  has 
a high  probability  of  finding  definitions  that  differ  from  any  of  the  above.. 

In  Figure  2-2  we  have  also  listed  the  regions  in  which  commercial  radio  and  television 
systems  operate,  as  well  as  ranges  commonly  ascribed  to  audio  and  video  technologies.  The  primary 
purpose  of  Figure  2-2  is  to  serve  as  a reference  that  may  serve  to  make  it  easier  for  the  system 
engineer  to  collect  and  compare  data  from  different  sources  and  communities.  Having  a common 
set  of  definitions  or  at  least  a “Rosetta  stone”  (such  as  Figure  2-2)  is  essential  to  cross-cultural 
communication. 
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Attenuation  of  Electromagnetic  Radiation  in  Bulk  Materials 

The  solutions  of  the  wave  equations  depend  (among  other  things)  on  the  dielectric  constant 
of  the  medium  in  which  the  waves  are  propagating.  In  general,  the  dielectric  constants  of  real 
materials  are  complex  functions.  They  are  also  generally  tensor  quantities  but  such  detail  is  the 
province  of  specialists  and  usually  not  of  importance  to  systems  engineers.  If  the  dielectric  function 
is  complex,  then  the  refractive  index  must  also  be  complex.  Let  us  define  the  complex  dielectric 
function  as  having  real  {S;)  and  imaginary  (£2)  parts,  as  shown  below. 

£=  - 1^2  (2.20) 

Then  the  refractive  index  must  be  complex 

n - ■yfs  - n - iK  (2.21) 


with  real  part  (n)  and  imaginary  part  (/c).  The  components  of  the  refractive  index  are  related  to  the 
real  and  imaginary  components  of  the  dielectric  function  by  the  relations 
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£^-n  -K 

S2  =2nfc 
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1/2 
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(2.22) 

(2.23) 


(2.24) 


(2.25) 


At  this  point  the  many  readers  will  be  asking  themselves  what  does  it  mean  for  a material 
to  have  a complex  refractive  index.  As  we  shall  soon  see,  the  impact  of  a complex  refractive  index 
is  that  the  material  absorbs  electromagnetic  energy.  The  intensity  (power  flowing  through  a unit 
area)  of  electromagnetic  radiation  is  proportional  to  the  square  of  the  electric  field 


I{r)  oc  \E{r) 
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where 


(2.27) 


a = iKkf^  = 4kk  / X 

is  the  absorption  coefficient.  There  is  an  enormous  body  of  technical  literature  which  has  tabulated 
the  real  and  imaginary  parts  of  the  refractive  index,  the  real  and  imaginary  parts  of  the  dielectric 
coefficient,  or  the  absorption  coefficients  of  many  materials  at  a number  of  wavelengths  of  interest. 
Use  of  the  relations  above  permits  conversion  of  data  obtained  in  one  form  to  other  potentially  more 
useful  forms.  As  a concrete  example,  we  reproduce  the  complex  refractive  index  data  for  water  in 
Figure  2-3  and  compare  this  with  the  absorption  coefficient  calculated  using  the  same  data  in  Figure 
2-4. 


In  the  microwave  region  of  the  spectrum,  some  technologists  have  adopted  a different 
nomenclature  for  absorption.  A quantity  called  the  loss  tangent  {tan  S)  is  defined  such  that 

tan  S = s^l  s^.  (2.28) 

Many  references  give  tabulated  data  on  loss  tangent  and  real  dielectric  coefficient  versus  frequency 
for  a number  of  materials.  In  terms  of  the  loss  tangent  the  other  dielectric  quantities  can  be 
determined  from  the  relations 


and 


£-  £^{\-i tan<J) 
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— ^(l  + tan^ 


£ 
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(2.29) 

(2.30) 


In  the  limit  that  the  loss  tangent  is  small  compared  to  unity,  Eq.  (2.30)  may  be  approximated  by 
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— tan^  S 


1/2 
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tan^ 


with  inverse 


tan^  = Aa  / Im  . 


(2.31) 

(2.32) 


There  are  also  occasions  in  which  it  is  necessary  to  calculate  the  attenuation  of  electromag- 
netic radiation  in  conductive  media.  If  cris  the  conductivity  (equal  to  1 /resistivity)  of  a medium  then 
the  attenuation  coefficient  is 


a 


= yJljUCOCT  « dWlO  ’ V(7  (for  non  - magnetic  materials) . (2.33) 


21 


R 

E 

A 

L 

P 

A 

R 

T 

0 
F 

R 

E 

F 

R 

A 

C 

T 

1 

V 

c 


N 

D 

E 

X 


1 np-m 


1 np+nn 


Reflection  and  Refraction 


When  an  electromagnetic  wave  is  incident  on  an 
interface  between  two  propagation  media  with  different 
characteristics,  several  things  happen.  Some  of  the  radia- 
tion is  reflected  by  the  interface,  while  some  of  the  radia- 
tion is  transmitted.  However,  the  propagation  direction  of 
the  transmitted  radiation  is  altered  in  a process  known  as 
refraction.  Consider  the  situation  outlined  in  Figure  2-5. 

The  incident  ray  with  amplitude  strikes  the 
interface  with  an  angle  ^ relative  to  the  normal  to  the 
surface.  The  reflected  ray  with  amplitude  leaves  the 
interface  at  an  angle  ^ relative  to  the  normal  and  lies  in  the 
plane  formed  by  the  incident  ray  and  the  surface  normal  on 
the  opposite  side  of  the  surface  normal  from  the  incident 
ray.  The  transmitted  ray  with  amplitude  Ej  leaves  the 
interface  at  an  angle  6,  relative  to  the  surface  normal  and 
lies  in  the  plane  formed  by  the  incident  ray  and  the  surface  normal. 


Figure  2-5.  Geometry  of  reflection 
and  refraction  at  an  interface. 


The  angle  of  the  transmitted  ray  is  related  to  the  angle  of  the  incident  ray  by  Snell’s  Law  of 
refraction 


sin  6^  - («j  / «2 ) sin  6^ 


(2.34) 


where  Uj  and  are  the  refractive  indices  of  the  two  media  as  shown  in  the  Figure.  Snell’s  law  of 
refraction  coupled  with  the  law  of  reflection  (angle  of  reflection  equals  angle  of  incidence)  form  the 
basis  of  almost  all  optical  design. 

The  reflected  component  is  of  significance  for  a number  of  reasons  that  will  be  discussed 
later  in  this  section.  Fresnel  developed  expressions  that  relate  the  reflected  component  amplitude 
to  the  incident  amplitude.  For  this  reason  we  commonly  refer  to  reflection  at  an  interface  between 
dielectric  media  as  Fresnel  reflection  [2].  Specifically,  Fresnel  obtained 


r nl/2 

cos  6',  - nl-  nf  sin^  6^ 

r 2 2 


(2.35) 


for  the  component  of  incident  radiation  polarized  perpendicular  to  the  plane  formed  by  the  incident 
ray  and  the  surface  normal  (that  is,  the  component  of  the  electric  field  vector  that  is  perpendicular 
to  this  plane),  and 
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(2.36) 


«2  cos  6^  - sin 

r nl/2 

n\  CQ)%6^  + «2  “ sin^  0^ 


for  the  component  of  incident  radiation  polarized  parallel  to  the  plane  formed  by  the  incident  ray  and 
the  surface  normal.  Because  these  two  expressions  are  different,  reflection  from  a dielectric  surface 
may  alter  the  polarization  of  the  radiation.  This  is  a major  design  concern  in  systems  that  attempt 
to  measure  the  polarization  of  the  radiation  as  an  integral  part  of  their  function. 

Equations  (2.35)  and  (2.36)  may  be  used  to  calculate  the  strength  and  degree  of  polarization 
alteration  of  reflected  radiation  at  any  angle  of  incidence.  However,  the  equations  are  complicated 
and  are  usually  not  used  for  back-of-the-envelope  analyses.  However,  at  normal  incidence,  the 
expressions  simplify  considerably  to 

Er  ^ +(^1  ~ ^2 

^0  llo,,  «1  + «2 


(2.37) 


Since  the  intensity  (or  power)  in  an  electromagnetic  field  is  proportional  to  the  square  of  the  field 
strength,  Eq.  (2.37)  may  be  squared  to  yield 


Pr 

- ri2 

Po 

normal 

(2.38) 


If  one  of  the  materials  is  air  («/  = 1),  then  we  get  an  even  simpler  expression 


Pr 

r«2  - 1^ 

Po 

normal 

v.^2  + ly 

(2.39) 


Equation  (2.39)  is  plotted  for  various  values  refractive  index  (and  dielectric  constant)  in  Figure  2-6. 
Glass  has  a refractive  index  of  roughly  1 .5  relative  to  air.  From  the  figure  we  find  that  the  Fresnel 
reflectance  is  0.04  per  surface.  A thin  pane  of  glass  will  reflect  8%  (4%  per  surface)  of  incident 
light.  In  another  example,  germanium  has  a refractive  index  4.0  in  the  infrared.  The  infrared 
Fresnel  reflectance  of  germanium  is  0.36.  Thus,  each  surface  of  a plate  of  germanium  will  reflect 
36%  of  the  incident  infrared  radiation.  The  reflectance  is  so  high  that  multiple  internal  reflections 
must  be  considered  when  determining  the  total  reflectance  which  corresponds  to  about  53%. 

In  general,  the  Fresnel  reflectance  for  perpendicular  polarized  radiation  will  increase 
monotonically  with  increasing  angle  of  incidence.  However,  for  parallel  polarized  radiation,  a quite 
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Figure  2-6.  Fresnel  reflectance  of  a surface  as  a function  of  relative  refractive 

index  or  dielectric  coefficient. 
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different  behavior  is  observed.  The  reflected  component  will  decrease  to  zero  as  the  angle  of 
incidence  approaches  an  angle  called  Brewster’s  angle.  Brewster’s  angle  is  defined  by 


6 ^ = tan  ' («2  / ) • 


(2.40) 


At  Brewster’s  angle  the  reflectance  of  the  perpendicular  polarization  is  given  by 


r 2 2 2 

Wj  cos  6^-  n^-  sin  6^ 

r 2 2 2 

WjCOs6>g  + n^-n^  sin  6j^ 


0. 


(2.41) 


Thus,  unpolarized  light  that  is  reflected  off  of  a surface  at  Brewster’s  angle  will  be  transformed  into 
light  which  is  perfectly  polarized  in  the  perpendicular  polarization.  The  different  behavior  of  the 
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two  polarizations  is  shown  graphically  in  Figure  2-7.  Note  that  the  average  reflectivity  for 
unpolarized  light  (light  containing  equal  amounts  of  parallel  and  perpendicular  polarization 
increases  with  increasing  angle  of  incidence.  However,  the  increase  is  very  small  until  the  incident 
angle  exceeds  40°. 


Figure  2-7.  Angular  dependence  of  Fresnel  reflection. 


ANGLE  OF  INCIDENCE  (deg) 


For  completeness,  Figure  2-8  shows  the  calculated  reflectivity  at  normal  incidence  of  liquid 
water  using  the  data  from  Figure  2-3.  This  was  calculated  using  the  expression 
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(2.42) 


which  can  be  obtained  directly  from  Eq.  (2.39)  by  expressing  explicitly  as  the  sum  of  its  real  (n) 
and  imaginary  (x)  parts.  The  reader  may  question  why  such  a large  quantity  of  supplemental 
material  is  being  included.  Part  of  the  answer  lies  in  attempting  to  bring  esoteric  mathematical 
results  to  a more  concrete  form.  Another  part  of  the  answer  is  that  the  information  may  be  useful 
to  the  reader  in  the  future  and  is  often  hard  to  locate.  To  a significant  extent,  the  author  hopes  that 
this  work  will  not  only  serve  a textbook  for  combat  systems  engineering,  but  also  as  a handbook  to 
assist  the  engineer  in  what  the  author  acknowledges  as  an  extremely  difficult  task. 
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Interference 


Two  electromagnetic  waves  can  interfere  with 
each  other.  The  intensity  at  any  point  in  space  is  equal 
to  the  absolute  value  squared  of  the  local  electric  field. 
The  local  field  is  the  vector  sum  of  the  electric  fields 
originating  from  all  contributing  sources.  If  the  frequen- 
cies of  the  two  wave  differ,  then  the  interference  pattern 
comes  and  goes  at  a frequency  equal  to  the  difference  in 
frequency  between  the  two  waves.  The  effect  is  not 
often  directly  observed  because  human  beings  typically 
have  sensor  frequency  responses  which  are  too  low  to 
see  the  difference  signal.  Nevertheless,  this  is  exploited 
in  heterodyne  receivers.  Heterodyne  reception  will  be 
discussed  in  more  detail  in  Chapter  14  on  coherent  laser 
radars.  However,  if  the  two  waves  have  the  same  fre- 
quency, the  interference  pattern  is  stationary  and  easily 
observed. 


Figure  2-9.  Interference  between 

waves  from  two  point  sources. 


Consider  spherical  waves  emanating  from  two 

point  sources  separated  by  a distance  a (as  illustrated  in  Figure  2-9.  Let  us  consider  the  field  at  a 
point  P at  some  considerable  distance  from  both  sources,  such  that  its  location  is  a distance  z along 
the  bisector  between  the  two  sources  and  distance  x perpendicular  to  the  bisector.  The  distance  z 
is  assumed  to  be  much  larger  than  the  wavelength  (this  will  let  us  ignore  the  differences  between 
r,  and  in  the  1/r  dependence  of  the  amplitude  terms  but  not  in  the  phase  (cosine)  terms).  The  local 
field  at  P is 


(2.43) 

^ {EJ +{EJ 

and  obviously  possesses  two  components.  The  intensity  at  point  P is  the  square  of  this  field  and  is 
seen  to  be 
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or 


/(at  P)  « ^ + El  + lE^E^  cos{k{r^  - r2))j 


where  we  have  use  the  complex  function  identity  relations 


- cosj^  ± i sin_y 


and 


e = (cosjF  + i sinj^)  + (cosjf  - i sin 7)  = 2cosjf 


If  Ej  and  E2  are  roughly  equal  in  strength,  then 


/(at  P)  oc  4El  cos"  (/(r,  - ) / 2) 


where  we  have  used  the  trigonometric  identity 


cos2>^  = 2 cos"  y -1. 


If  z is  much  greater  than  both  the  transverse  distance  x and  the  source  spacing  a then 
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Therefore, 


/(at  P)  oc  4Ef 
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(2.52) 


with  maxima  at  x = mXz/a  where  m is  an  integer.  We  see  that  two-wave  interference  produces  a 
sinusoidal  spatial  intensity  pattern  with  peaks  that  are  4 times  the  average  value  of  the  intensity  of 
a single  source  and  valleys  that  drop  to  zero  intensity.  Interference  between  waves  from  more  than 
three  sources  can  produce  complex  patterns  with  the  highest  peaks  being  even  larger  than  those  from 
two  sources. 
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Huygens’  Principle  and  Diffraction 

At  any  point  in  space,  the  electric  field  of  an  electromagnetic  wave  can  be  defined  in  terms 
of  its  vector  field  strength  E , angular  frequency  <u,  propagation  direction  k , and  a phase  ^ 


E 


k-r  - cot  + ifi^ 


(2.53) 


A wavefront  of  a monochromatic  (single  frequency  co)  electromagnetic  wave  is  any  surface  on  which 
the  phase  is  constant  and  equal. 

Huygens  postulated  that  the  propagation  of  electromagnetic  waves  could  be  analyzed  by 
assuming  that  each  point  on  a wavefront  acts  as  a source  of  a spherical  “wavelets”  (i.e.  a tiny 
spherical  wave).  The  coherent  summation  (and  interference)  of  the  wavelets  at  some  distance  from 
the  original  wavefront  generates  a new  wavefront.  This  wavelet  generation  followed  by  interference 
followed  by  wavelet  generation  can  be  continued  to  determine  the  wavefront  at  any  distance  given 
the  wavefront  at  some  plane.  This  is  the  essence  of  Huygens’  principle. 

Consider  Figure  2-10.  The  parallel  lines  repre-  Figure  2-10.  Addition  of  Huygens’ 
sent  wavefronts  of  a plane  wave  separated  by  one  wave-  wavelets  to  reconstruct  the  wavefront 
length.  The  dots  represent  some  of  an  infinite  number 
of  a plane  wave. 

of  points  on  one  of  the  wavefronts  used  to  generate 
Huygens’  wavelets.  The  wavefront  of  each  wavelet  is 
shown  as  it  expands  away  from  the  source  point.  Each 
of  these  wavefronts  is  assumed  to  have  the  same  phase 
as  the  others.  Note  further  that  the  wavelets  appear  to 
add  in  phase  along  a line  which  coincides  with  one  of 
the  original  plane  wavefronts.  If  we  look  at  the  first 
point  on  either  side  of  the  center  point,  the  angle  away 
from  the  plane  wave  propagation  direction  is  symmet- 
ric. Any  tendency  for  one  point  to  cause  a change  in 
direction  is  exactly  counteracted  by  the  opposite  ten- 
dency resulting  from  the  symmetric  point.  Since  the 
line  of  points  is  infinite,  for  any  point  that  would  cause 
deviation  from  the  plane  wave  character,  there  is  a 
symmetric  point  that  prevents  that  deviation. 

Mathematically,  Huygens’  principle  can  be 
represented  as  an  integral  over  the  input  plane  of  a 
large  number  of  spherical  waves  generated  by  the 
points  of  that  plane.  Specifically,  we  will  represent  an 
arbitrary  electromagnetic  wave  as  the  real  part  of  a complex  function 
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(2.54) 


For  this  input  function,  Huygens’  integral  is 


#('■)  = 7/1 


l + coscf  e 


ik\r-ro\ 


r-ro 


Output  Surface  Input  Obliquity  Spherical 

Field  Integral  Field  Factor  Wavelet 


(2.55) 


where  the  individual  terms  have  been  specifically  identified  with  a (in  the  obliquity  factor)  being 
the  angle  between  the  vector  r —r^  and  the  z-axis  and  k being  the  wavenumber 


k = 2k  ! X = 2kco  / c . 


(2.56) 


The  Huygens’  integral  correctly  predicts  many  observed  propagation  phenomena.  However,  it  is 
difficult  to  evaluate.  It  is  common  to  use  an  approximation  called  the  Fresnel  approximation. 
Specifically  we  define  L=z-Zq  and  we  assume  cos  a~  1.  In  this  approximation. 


r-ro  = 


(x-x^f  +(y-y,f  +(z-z,f 


(2.57) 


Substituting  the  Fresnel  approximation  into  Huygens’  integral  leads  to  the  Fresnel  diffraction 
equation 


^(x,  y,  Zq  + L) 


JJt/Xotfyo  {^(Xo,Vo,Zo) 

nput 

)lane 

xexp  -^[(x-x,f  +{y-y,f^ 


(2.58) 


This  equation  can  be  explicitly  calculated  to  determine  how  the  field  at  any  one  plane  will  transform 
when  it  propagates  to  a distant  plane. 
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In  general,  diffraction  calculations  are  the  Figure  2-11.  Huygens’  wavelets  as  an 
province  of  sensor  design  specialists.  However,  explanation  of  the  origin  of  diffraction, 
knowing  that  Huygens’  principle  leads  to  a mathe- 
matically correct  diffraction  integral  makes  it  easier 
for  us  to  visualize  why  diffraction  occurs.  Consider 
Figure  2-11.  In  this  figure  an  aperture  blocks  all  of 
the  wavefront  except  for  a small  amount.  Only 
inside  the  aperture  is  there  any  field  to  produce 
Huygens’  wavelets.  Wavelets  produced  near  the 
center  of  the  aperture  behave  much  like  there  was 
no  aperture  present.  They  see  a roughly  symmetric 
situation  with  wavelets  on  one  side  of  center  seeing 
corresponding  wavelets  on  the  opposite  side  to 
cancel  out  their  spherical  expansion  characteristics. 

Radiation  corresponding  to  those  wavelets  contin- 
ues to  propagate  in  the  general  direction  of  the 
original  wavefront.  However,  wavelets  produced  at 
the  edge  of  the  aperture  see  an  asymmetric  situa- 
tion. Clearly,  the  wavelets  which  would  cancel  the 
spherical  character  are  missing.  Lacking  the  sym- 
metric wavelets,  radiation  near  the  edges  of  the  aperture  will  maintain  its  spherical  character, 
allowing  the  wave  to  bend  around  the  comer  of  the  aperture.  Those  with  “tme  faith”  can  even  draw 
upon  the  fact  that  interference  is  the  source  of  the  success  of  Huygens’  principle,  to  argue  that  there 
should  be  some  sort  of  interference  pattern  superimposed  on  the  general  spreading.  Indeed,  there 
are  interference  effects,  although  they  are  not  the  simple  two-source  pattern  calculated  earlier. 


A few  cases  of  diffraction  can  be  evaluated  explicitly.  One  of  these  is  diffraction  by  a 
uniformly  illuminated  circular  aperture.  Let  a represent  the  radius  of  the  aperture.  The  result  of 
calculations  which  will  not  be  repeated  here,  is  that  the  field  pattern  at  a considerable  distance  from 
the  aperture  is  given  by  the  Airy  function  [2].  That  is. 


I(R,0’) 


Pna^ 

’2Ji(;r(Z)/;i)sin<9') 

! A)sin^') 

(2.59) 


where  Ji(x)  is  the  Bessel  function  of  the  first  kind  of  order  1,  D=2a  is  the  diameter  of  the  aperture, 
T is  the  wavelength  of  the  illuminating  radiation,  and  P is  the  total  power  incident  on  the  aperture. 
The  squared  quantity  in  the  rectangular  brackets  is  normalized  to  unity  at  zero  angle  and  also 
normalized  to  unity  total  area  (the  integral  over  all  angles  is  unity).  If  we  assume  that  the  angle  9’ 
is  small  then  we  may  assume  that  sin  9’  ~ 9’.  If  we  further  define  the  angle  9 is  the  angle  9’ 
measured  in  units  of  the  characteristic  diffraction  angle  A/D,  then  Eq.  (2.59)  can  be  rewritten  as 

l2 


i(R,e)  = 


Pm^ 


2 Ji(;r6>) 

{Tie) 


(2.60) 


This  function  is  shown  graphically  in  Figure  2-12. 
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Figure  2-12.  The  Airy  function  - diffraction  from  a circular  aperture. 


OFF-AXIS  ANGLE  (units  of  X/D) 


As  indicated  in  Figure- 1 2 and  shown  photographically  in  Figure  2-13,  the  diffraction  pattern 
is  a bright  central  spot  (or  lobe)  surrounded  by  concentric  rings  of  rapidly  decreasing  intensity.  The 
position  of  the  minimum  between  the  central  lobe  and  the  first  ring  is  given  by 

6^^  = 1220)1/ D.  (2.61) 

For  reasons  that  will  be  discussed  in  Chapter  11,  this  angle  is  often  associated  with  the  limiting 
angular  resolution  of  sensors  with  aperture  D and  operating  at  wavelength  1. 

For  values  of  7,  the  Bessel  function  in  Eq.  (2.60)  can  be  approximated  by 

J,(x)  = p/>rx  cos(x-(3;r/4))  (2.62) 

and  the  envelope  of  the  peaks  can  be  determined  by  setting  cos  x = 1.  This  result  shown  here, 
occasionally  finds  utility  in  applications  such  as  determining  the  angular  distance  over  which  a 
bright  source  like  the  sun  might  produce  veiling  glare  in  a sensor,  or  how  fast  the  radar  cross  section 
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of  a circular  disc  falls  off  with  off-normal  reflectance  angle.  The  envelope  function f(  6)  is  explicitly 
illustrated  in  Figure  12-2  and  is  given  by 

\2 
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(2.63) 
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The  intensity  of  the  first  bright  ring  is  seen  to  be  17  dB  (see  Appendix  A on  units  for  a description 
of  logarithmic  units)  below  the  intensity  of  the  central  lobe.  One  must  move  an  angular  distance  at 
least  9.36  A/D  before  the  sidelobe  intensity  falls  below  -40  dB. 


The  diffraction  pattern  from  a rectangular  aperture  of  horizontal  side  2a  and  vertical  side  2b 
is  given  by  the  function 


I{R,0,^)  = 


PAab 

^ sin(;T(2fl  / A)  sin  0^  ^ 

( sm(;r(2Z)//l)sin  (fy 

^ 7v{2a ! X)^m0  ^ 

^ 7v{2b ! A)^in^  ^ 

(2.64) 


where  6>is  the  angle  with  respect  to  the  horizontal  and  the  angle  with  respect  to  the  vertical.  We 
will  see  the  function  {(sin  x)/x  = sine  x}  time  and  again.  It  will  be  especially  prominent  when  we 
discuss  radar  cross  sections.  A photograph  of  the  diffraction  pattern  from  a rectangular  aperture  is 
compared  with  the  diffraction  pattern  from  a circular  aperture  in  Figure  2-13. 


Figure  2-13.  Photographs  of  the  diffraction  patterns  from  a 
a)  circular  aperture  and  b)  rectangular  aperture.  [2] 

a)  b) 
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Mie  Scattering 

Mie  scattering  is  the  scattering  of  electromagnetic  radiation  by  spherical  particles  having  a 
dielectric  constant  and/or  a conductivity  which  differs  from  the  medium  surrounding  the  particle  [4] . 
Calculation  of  this  phenomena  is  fairly  complex  and  will  not  be  described  here.  However,  the 
results  of  such  calculations  are  extremely  interesting  and  can  be  qualitatively  applied  to  understand- 
ing many  phenomena  encountered  in  the  propagation  of  electromagnetic  radiation.  First,  we  must 
understand  the  concept  of  cross  section.  Assume  that  a plane  wave  of  radiation  is  incident  on  a unit 
area.  If  we  place  a scatterer  (an  object,  or  in  the  Mie  scattering  problem,  a small  spherical  particle), 
then  part  of  the  radiation  will  be  blocked  by  the  particle.  This  blocked  radiation  is  either  absorbed 
or  it  is  scattered  into  all  directions.  Due  to  diffraction,  a particle  may  actually  block  more  or  less 
area  than  the  physical  cross  sectional  area  of  the  particle.  The  scattering  cross  section  of  an  object 
is  the  apparent  (as  opposed  to  physical)  size  of  the  object  based  on  the  fraction  of  the  incident 
radiation  that  is  affected.  Let  the  unit  area  be  1 cm^.  If  the  physical  cross  sectional  area  of  the 
sphere  is  na^  = 0.0001  cm^  but  scatters  1%  of  the  incident  radiation,  then  the  total  scattering  cross 
section  is  0.01  x 1 cm^  = 0.01  cm^  (100  times  the  real  physical  cross  section).  If  the  fractional 
scattered  radiation  is  measured  as  a function  of  angle,  it  is  possible  to  determine  a differential  cross 
section  (usually  given  units  of  cmVsr). 

The  most  commonly  presented  Mie  scattering  result  is  a graph  of  the  total  scattering  cross 
section  as  a function  of  wavelength  of  the  scattered  radiation.  Figure  2-14  shows  the  Mie  scattering 
cross  section  of  a perfectly  conducting  metallic  sphere  of  radius  a.  The  calculated  curve  is 
normalized  to  the  physical  spherical  cross  section  The  abscissa  of  the  plot  is  ka  = 2m/X.  There 
are  three  obviously  different  physical  regions.  When  ka  > 20,  that  is,  when  the  wavelength  is  much 
smaller  than  the  particle  radius,  the  scattering  cross  section  is  exactly  equal  to  the  physical  cross 
section.  This  is  usually  referred  to  as  the  optical  region.  Diffraction  is  a minor  contributor  in  the 
optical  region. 


Figure  2-14.  Normalized  Mie  scattering  cross  section  of  a perfectly  conducting  metallic  sphere. 
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When  ka  < 0.5,  that  is  when  the  wavelength  is  much  larger  than  the  particle  radius,  the  scattering 
cross  section  falls  off  as  the  inverse  fourth  power  of  the  wavelength  (a  a This  region  is  known 

as  the  Rayleigh  region.  In  between  the  Rayleigh  and  optical  regions,  the  cross  section  oscillates  up 
and  down  about  the  optical  value  m^.  This  region  is  called  the  resonance  region.  The  oscillations 
are  due  to  interferences  between  waves  which  are  scattered  from  the  front  surface  of  the  sphere  and 
waves  which  are  diffracted  completely  around  the  sphere. 

This  qualitative  shape  of  the  Mie  curve  (Rayleigh  region,  resonance  region,  and  optical 
region)  will  be  valid  for  virtually  any  spherical  particle.  However,  the  exact  shapes  for  weakly 
conducting,  non-conducting  (dielectric),  and/or  absorbing  particles  will  vary  depending  on  the  real 
and  imaginary  parts  of  the  refractive  index  and  the  conductivity.  The  strength  and  number  of 
oscillations  are  strongly  affected  by  the  material  characteristics.  The  presence  of  a strong  absorption 
in  a region  of  the  spectrum  will  cause  significant  dips  in  the  cross  section  in  that  region.  Further- 
more, clouds  of  particles  will  have  particle  of  different  radii.  In  this  case  the  appropriate  Mie  curve 
must  be  averaged  over  the  distribution  in  sizes  (radius  a).  The  resulting  averaged  curve  will  usually 
not  exhibit  significant  resonance  oscillations  (larger  particles  will  have  their  scattering  peaks  at  the 
minima  of  smaller  particles).  However,  unless  the  range  of  particle  sizes  extends  over  many  orders 
of  magnitude,  the  average  curve  will  still  have  pronounced  Rayleigh  and  optical  regions  with  a 
smooth  shoulder  replacing  the  resonance  region.  Even  non-spherical  particles  will  have  scattering 
curves  that  possess  Rayleigh  regions,  optical  regions,  and  resonance  regions.  However,  the  non- 
spherical  particle  curves  will  exhibit  significant  polarization  dependences  and  have  different  angular 
distributions  of  scattered  radiation.  Even  complex  objects  such  as  ships  or  airplanes  will  have  a flat 
region  in  a <t  versus  HX  plot  that  corresponds  to  an  optical  region  (wavelength  small  compared  to 
any  characteristic  scale  length  of  the  target)  and  to  region  that  corresponds  to  the  Rayleigh  region 

(wavelength  large  compared  to  the  maximum  dimension  of  the  target). 

The  virtual  indifference  of  the  qualitative  Mie  curve  to  external  conditions  allows  its  use  in 
many  different  situations.  We  shall  see  that  it  can  be  used  to  describe  the  attenuation  of  electromag- 
netic radiation  by  rainfall  and  by  clouds  and  fogs  from  the  microwave  region  through  the  visible 
region.  It  can  be  used  to  explain  why  the  sky  is  blue.  Air  molecules  act  as  dielectric  particles  that 
are  small  compared  to  visible  light.  Thus  scattering  occurs  in  the  Rayleigh  region.  Blue  light  (small 
wavelengths)  from  the  sun  is  thus  more  strongly  scattered  than  red  light  (large  wavelengths).  Mie 
scattering  can  also  explain  why  most  military  smokes  are  not  very  effective  in  the  infrared.  We  will 
go  into  more  detail  on  these  and  other  phenomena  later  in  this  volume. 
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Problems 


2- 1 . From  what  does  the  existence  of  electromagnetic  radiation  follow  as  an  inevitable  outcome? 

2-2.  What  functional  form  do  solutions  of  the  equation 


1 

dx^  dt^ 


(2.65) 


take?  The  quantity  (^represents  any  of  a number  of  real  physical  quantities  which  will  not 
be  delineated  here.  Equation  (2.65)  is  one  of  the  classic  equations  of  physics.  What  name 
is  given  to  equations  of  the  form  of  Eq.  (2.65)? 

2-3.  You  need  to  find  a window  material  for  a dual  mode  seeker  operating  at  10  pm  and  at  95 
GHz.  Gallium  Arsenide  has  a complex  refractive  index  of  3.277  + i6. 1x10'^  at  9.6  pm  and 
3.606  + i0.0012  at  95  GHz.  What  is  the  absorption  coefficient  of  gallium  arsenide  at  9.6 
pm?  At  95  GHz?  What  is  the  loss  tangent  at  95  GHz.  Windows  are  nominally  1 cm  thick. 
From  an  absorption  perspective  only  is  gallium  arsenide  likely  to  be  a good  window 
material. 

2-4.  The  refractive  indices  before  and  after  an  interface  between  two  media  are  n,  and  ri2, 
respectively.  If  the  angle  of  incidence  is  45°,  derive  an  expression  for  the  reflection 
coefficient  (£'//£'/). 

2-5.  What  theoretical  construct  is  used  to  derive  diffraction  and  interference  relations? 

2-6.  Two  point  sources  of  electromagnetic  radiation  are  separated  by  a distance  a.  If  the  intensity 

of  one  source  (#1)  is  twice  that  of  the  second  source  (#2),  derive  an  expression  for  the 
interference  pattern  at  an  arbitrary  distance.  Use  the  geometry  of  Figure  2-9. 

2-7.  Using  Huygens’  principle  and  the  empirical  diffraction  patterns  for  the  circular  and 
rectangular  apertures  as  guides,  estimate  (sketch)  the  diffraction  pattern  expected  at  some 
distance  from  a knife  edge.  A knife  edge  can  be  assumed  to  be  an  infinite  half-plane  that 
blocks  radiation  propagation  next  to  an  empty  infinite  half-plane. 

2-8.  A radar  reflecting  target  is  made  from  a spherical,  metallic-coated  balloon.  The  diameter  of 
the  balloon  is  20  cm.  What  is  a good  estimate  of  the  scattering  cross  section  cat  a frequency 
of  300  MHz?  At  a frequency  of  3 GHz?  At  a frequency  of  30  GHz?  Which  of  these 
estimates  is  most  likely  to  be  in  error? 
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CHAPTER  3 


PROPAGATION  OF  ELECTROMAGNETIC  RADIATION.  II. 
- WEATHER  EFFECTS 


Introduction 

The  characteristics  of  the  propagation  medium  have  the  most  profound  influence  on  sensor 
performance.  We  begin  our  studies  with  the  important  case  of  sensors  employing  propagation  of 
electromagnetic  radiation  through  the  atmosphere.  To  fully  understand  the  performance  of  such 
sensors  we  must  understand  the  constituents  of  the  atmosphere,  the  structure  of  the  atmosphere,  and 
how  both  the  constituents  and  structure  varies  with  time  and  location.  This  requires  at  least  a 
minimal  understanding  of  both  meteorology  (weather)  and  climatology. 

The  most  profound  impact  of  the  atmosphere  on  general  sensor  performance  comes  through 
its  ability  to  attenuate  electromagnetic  signals.  The  attenuation  (often  called  extinction)  consists 
of  two  parts:  absorption  and  scattering.  Absorption  is  the  conversion  of  electromagnetic  energy  into 
internal  excitations  of  the  atmospheric  constituents  or  into  heat.  Scattering  is  the  alteration  of  the 
direction  of  propagation  of  a unit  of  electromagnetic  energy  without  any  significant  reduction  in  the 
magnitude  of  that  energy.  Extinction  (both  absorption  and  scattering)  tends  to  reduce  the  strength 
of  signals  that  sensors  are  trying  to  detect.  Scattering  (especially  of  other  sources  of  electromagnetic 
radiation)  tends  to  affect  the  noise  that  tends  to  make  detection  of  the  signal  more  difficult. 
Therefore  both  scattering  and  absorption  are  important  to  electromagnetic  sensor  systems.  It  is 
important  to  remember  that 

EXTINCTION  = SCATTERING  + ABSORPTION 
Knowledge  of  two  of  the  three  components  allows  calculation  of  the  third. 

Before  proceeding  to  address  the  details  of  extinction,  it  is  useful  to  introduce  a concept  that 
has  great  practical  utility  and  is  a baseline  input  to  many  extinction  calculation.  Visibility  is  the 
greatest  range  at  which  an  unaided  human  observer  can  detect  objects  of  interest.  It  has  two  working 
definitions.  In  daytime,  visibility  is  the  greatest  range  at  which  a very  dark  object  can  be  detected 
against  the  sky  at  the  horizon.  At  night,  visibility  is  the  greatest  range  at  which  an  unfocused, 
moderately  intense  light  source  can  be  detected.  In  practice,  visibility  is  estimated  by  looking  at 
appropriate  objects  with  known  ranges.  The  visibility  is  somewhere  between  the  range  of  the 
farthest  object  you  can  see  and  the  closest  object  you  can’t  see. 

To  place  visibility  into  a more  scientific  perspective,  the  meteorological  community  defined 
a related  term,  the  meteorological  range  (or  sometimes  the  meteorological  visibility  range),  V. 
It  is  defined  by  the  Koschmieder  formula  [I] 
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(3.1) 


F = ilnl  = ^ 

pep 

where  /?is  the  scattering  coefficient  at  550  nm  wavelength  and  ^•is  a threshold  contrast  for  detection. 
For  human  visual  detection  it  has  been  found  that  sis  approximately  0.02.  For  most  aerosols  in  the 
visible,  absorption  is  negligible  and  p is  usually  approximated  by  the  extinction  coefficient. 

Visibility  is  a subjective  measure;  meteorological  range  is  quantitative.  Visibility  is  easily 
measured  without  expensive  instruments  and  is  reported  in  routine  weather  measurements. 
Meteorological  range  requires  expensive  instruments  to  measure.  It  is  commonly  used  as  an  input 
into  atmospheric  propagation  computer  codes.  The  two  are  often  (and  incorrectly)  used  interchange- 
ably. If  only  an  observed  visibility,  is  available,  the  meteorological  range  can  be  estimated  by 

V = (1.3  ± 0.3)  X . (3.2) 

Meteorological  range  has  been  grouped  into  10  international  visibility  codes  shown  in  Table  3-1. 

[2] 


Table  3-1.  International  Visibility  Codes 


CODE 

NUMBER 

WEATHER 

CONDITION 

METEOROLOGICAL 
RANGE  (km) 

SCATTERING 
COEFFICIENT  (km  b 

0 

Dense  Fog 

< 0.05 

> 78.2 

1 

Thick  Fog 

0.05  - 0.20 

78.2  - 19.6 

2 

Moderate  Fog 

0.2  - 0.5 

19.6  - 7.82 

3 

Light  Fog 

0.5  - 1.0 

7.82  - 3.91 

4 

Thin  Fog 

q 

CN 

q 

3.91  - 1.96 

5 

Haze 

2.0  - 4.0 

1.96  - 0.954 

6 

Light  Haze 

4.0  - 10.0 

0.954  - 0.391 

7 

Clear 

10.0  - 20.0 

0.391  - 0.196 

8 

Very  Clear 

20.0  - 50.0 

0.196  - 0.078 

9 

Exceptionally  Clear 

> 50.0 

< 0.078 



Pure  Air 

277 

0.0141 
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Many  atmospheric  constituents  contribute  to  atmospheric  scattering  and/or  absorption. 
These  are  summarized  in  Table  3-2.  Molecules  are  extremely  small  “particles”.  It  is  not  unreason- 
able to  expect  that  since  the  characteristic  dimensions  “a”  are  very  small  (atmospheric  molecules 
are  of  the  order  of  one  nanometer  in  dimension)  compared  to  the  wavelength  (hundreds  of 
nanometers),  that  molecules  would  scatter  radiation  similar  to  the  Rayleigh  region  of  the  Mie 
scattering  curve.  This  is  indeed  observed  and  is  called  Rayleigh  scattering.  This  is  a very  weak 
effect.  The  extinction  coefficient  for  Rayleigh  scattering  at  550  nm  in  a sea-level  atmosphere  is  only 
0.0141  km  '.  The  corresponding  visibility  is  277  km.  All  molecules  contribute  to  Rayleigh 
scattering.  The  sky  is  blue  because  of  Rayleigh  scattering.  Due  to  the  l/T"'  dependence  of  the 
scattering  strength,  blue  light  at  400  nm  is  scattered  13  times  more  strongly  than  deep  red  light  at 
760  nm.  Since  the  scattering  during  the  middle  of  the  day  (when  the  sun  is  well  above  the  horizon) 
is  not  strong  enough  to  completely  attenuate  the  blue  component,  everywhere  we  look  there  is  an 
equally  strong  blue  excess  over  red.  Sunsets  tend  to  be  red,  because  at  the  low  grazing  angles  at 
sunset  allow  a propagation  path  long  enough  for  all  of  the  blue  light  to  be  attenuated.  Only  the  red 
light  remains. 

Some  molecules  contribute  to  absorption.  It  is  useful  to  distinguish  between  different  kinds 
of  atmospheric  molecules.  Seven  gases  (O2,  Nj,  CO2,  Ar,  CH4,  N2O,  and  CO)  constitute  the 
uniformly  mixed  gases.  Uniformly  mixed  means  that  the  relative  proportions  of  each  gas  are 
essentially  independent  of  altitude  or  geographic  location.  Of  the  uniformly  mixed  gases,  two  have 
very  significant  absorptions.  Oxygen  (O2)  has  two  strong  absorption  lines  in  the  microwave  region 
and  one  strong  absorption  line  at  the  red  edge  of  the  visible  region.  Carbon  dioxide  (CO2)  absorbs 
at  a large  number  of  wavelengths  throughout  the  infrared  region. 

Two  abundant  gases  are  not  uniformly  mixed.  As  we  shall  see,  water  vapor  (H2O)  is 
concentrated  near  the  earth’s  surface,  while  ozone  has  a maximum  concentration  in  the  stratosphere. 
Water  vapor  absorbs  at  many  places  throughout  the  infrared  and  microwave  regions.  Ozone  has 
limited  absorption  in  the  infrared,  but  is  an  intense  absorber  of  ultraviolet  radiation  in  the  region 
from  200-300  nm.  The  significance  of  the  absorption  of  both  of  these  molecules  will  be  discussed 
in  detail  later. 

There  are  many  trace  gases  (twenty  of  which  are  of  sufficient  significance  to  warrant 
inclusion  in  atmospheric  extinction  models  and  are  explicitly  named  in  the  table).  Many  of  these 
trace  gases  are  the  result  of  natural  processes:  photochemical  degradation  of  organic  chemical 
vapors  emitted  by  plants,  decay  and  decomposition  of  dead  organic  matter,  emission  from  volcanic 
(or  other  geothermal)  events,  and  production  by  lightning  or  solar  wind  radiation  interaction  with 
more  abundant  atmospheric  species.  A few  are  characteristically  human  industrial  pollutants.  None 
of  the  trace  gases  have  absorption  strong  enough  to  influence  any  sensor  that  isn’t  designed  to 
explicitly  detect  those  trace  gases.  The  same  is  true  of  artificial  contaminant  vapors  (such  as 
chemical  warfare  agents  or  HAZMAT). 

In  the  atmosphere  there  is  a special  type  of  molecular  absorption  that  is  not  commonly 
encountered  elsewhere,  the  so-called  continuum  absorption.  Continuum  absorption  results  when 
collisions  between  molecular  species  alter  the  energy  levels  of  those  molecules  in  such  a way  that 
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Table  3-2.  Potential  contributors  to  atmospheric  extinction. 


MOLECULAR  SCATTERING  (RAYLEIGH  SCATTERING) 
MOLECULAR  ABSORPTION 


- UNIFORMLY-MIXED  GASES 

- WATER  VAPOR  (HjO) 

- OZONE  (O3) 

- 0, 

CH4 

0 

CS  CN 

z z 

CO2 

CO 

Ar 

- TRACE  GASES 

- NO 

S02 

NO2 

NH3 

HNO3 

OH 

HF 

HCI 

HBr 

HI 

CIO 

OCS 

H2CO 

HOCI 

HCN 

H2O2 

CH3CI 

C2H2 

C2H6 

PH3 

- CONTAMINANTS  - VOLATILE  ORGANIC  COMPOUNDS 


- CHEMICAL  WARFARE  AGENTS 

- POLLUTANTS 

CONTINUUM  ABSORPTION  (COLLISION-INDUCED  ABSORPTIONS) 

- WATER  VAPOR 

- NITROGEN 


PARTICULATE  SCATTERING  AND  ABSORPTION 
- BOUNDARY  LAYER  PARTICULATES  - I 


UPPER  TROPOSHERIC  "AEROSOLS" 
STRATOSPHERIC  "AEROSOLS" 

CLOUDS 


HAZE  AEROSOLS  (RURAL,  URBAN, 
AAARITIME,  DESERT) 

FOG 
ICE  FOG 
DUST 
SMOKE 

ARTIFICIAL  OBSCURANTS 

METEORITIC  DUST 
VOLCANIC  DUST 


PRECIPITATION  (HYDROMETEORS) 

- DRIZZLE 

- RAIN 

- SNOW 

- GRAUPEL 

- HAIL 

- SLEET 
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absorptions  now  occur  where  in  the  absence  of  collision  no  absorptions  would  normally  occur.  As 
two  molecules  collide  and  subsequently  move  apart,  the  energy  levels  morph  continuously  from 
their  isolated  values  to  some  altered  collisional  values  and  back  again.  Thus,  the  collisionally 
altered  absorptions  are  very  much  broader  than  the  absorptions  of  the  isolated  molecules,  hence  the 
use  of  the  adjective  “continuum”.  It  should  be  noted  that  collisions  can  induce  absorptions  where 
no  absorption  is  possible  in  the  isolated  molecule. 

Two  atmospheric  molecules  have  significant  continuum  absorption:  water  vapor  and 
nitrogen.  Water  molecules  interact  strongly  with  other  water  molecules,  but  interact  only  weakly 
with  nitrogen  and  oxygen.  Thus,  water-water  collisions  have  a strong  collisional  absorption.  Since 
the  rate  at  which  water  molecules  collide  with  other  water  molecules  is  proportional  to  the  square 
of  the  partial  pressure  (molecular  density)  of  water  vapor,  one  would  expect  that  the  importance  of 
continuum  absorption  to  be  much  higher  when  the  partial  pressure  of  water  vapor  is  high  (humid, 
sea  level)  than  when  it  is  low  (dry,  moderate  altitudes  and  above).  This  is  indeed  observed.  The 
strong  collisional  dependence  of  absorption  of  water  dramatically  manifests  itself  in  the  absorption 
of  liquid  water.  We  have  already  seen  that  liquid  water  is  virtually  opaque  everywhere  except  for 
a narrow  window  in  the  blue-green  region.  Nitrogen  is  a molecule  that  at  low  pressure  has  no 
visible,  infrared,  or  microwave  absorption.  However,  there  is  a collision-induced  absorption 
between  4 pm  and  5 pm  that  must  be  incorporated  in  predictions  of  infrared  attenuation. 

Isolated  molecules  are  not  the  only  constituents  of  the  atmosphere.  Particulates  can  be  found 
at  all  altitudes  and  geographic  locations.  Particulates  small  enough  that  they  are  at  least  partially 
subject  to  Brownian  motion  and  remain  airborne  for  extended  periods  of  time  are  often  referred  to 
as  aerosols.  Aerosol  has  a technical  definition  meaning  an  airborne  suspension  of  a liquid  or  solid. 
Atmospheric  aerosols  may  contribute  to  absorption  in  some  spectral  regions  (depending  on  their 
chemical  constitution)  and  will  contribute  to  scattering  in  all  regions  with  wavelengths  comparable 
to  or  smaller  than  the  particle  size.  The  scattering  behavior  can  usually  be  described  as  “Mie-like”. 

In  the  lower  few  kilometers  of  the  atmosphere  (the  boundary  layer)  we  encounter  haze 
aerosols,  fogs,  smoke,  dust,  and  artificial  obscurants  (military  smokes).  At  higher  altitudes  there  are 
clouds  and  volcanic  dust.  At  very  high  altitudes  there  is  a steady  influx  of  meteoritic  dust.  Hazes 
generally  produce  visibilities  of  a few  kilometers.  Fogs  can  produce  visibilities  of  100  m or  less, 
as  can  clouds.  Smoke  and  dust  from  distant  sources  can  be  so  thin  as  to  produce  only  mild 
reductions  in  visibility  or  so  dense  that  visibilities  are  measured  in  single-digit  meters.  Even 
volcanic  dust  has  high  variability.  Those  who  remember  the  eruption  of  Mt.  St.  Helens  in 
Washington  or  Mt.  Pinatubo  in  the  Philippines  both  of  which  dumped  inches  of  falling  ash  over 
thousands  of  square  miles  of  territory,  can  attest  to  the  worst  cases. 

A final  contributor  to  atmospheric  extinction  is  the  class  of  hydrometeors.  Hydrometeor 
means  “water  in  the  air”.  Technically,  it  includes  fogs  and  clouds.  However,  from  a practical 
perspective  the  term  is  often  associated  with  “falling  water”  or  precipitation.  There  is  drizzle  (liquid 
water  with  drop  sizes  less  than  0.5  mm),  rain  (liquid  water  with  drop  sized  greater  than  0.5  mm), 
snow  (crystalline  solid  water),  hail  (large  balls  of  frozen  water  formed  by  rain  being  cycled  up  and 
down  between  high  liquid  content  regions  and  freezing  regions  of  a cloud),  graupel  (small  pellets 
of  snow  formed  by  partial  melting  and  refreezing  of  snow),  sleet  (mixed  rain  and  snow),  among 
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many  other  forms.  The  primary  characteristic  of  hydrometeors  is  their  large  size  (they  tend  to  be 
large  enough  to  fall  with  a steady  terminal  velocity).  As  a result  their  effects  extend  much  farther 
into  the  microwave  region  than  aerosol  effects.  The  magnitudes  of  hydrometeor  effects  depend 
strongly  on  the  quantity  of  water  (converted  to  an  equivalent  liquid  water  precipitation  rate)  that  falls 
and  on  the  type  of  hydrometeor  (rain  does  not  produce  as  large  an  effect  as  does  snow  for  equivalent 
liquid  water  precipitation  rate). 
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Structure  of  the  Atmosphere 


The  Earth’s  atmosphere  is  not  uniform  in  either  space  or  time.  There  are  very  distinct 
variations  in  atmospheric  characteristics  as  one  moves  from  one  pole  towards  the  equator  and  on  to 
the  other  pole.  There  are  variations  as  one  moves  from  coastal  regions  to  interior  regions  of  the 
continents  and  as  one  moves  from  lowlands  to  mountainous  areas.  There  are  major  variations  as  the 
seasons  change  from  summer  to  fall  to  winter  to  spring  and  back  to  summer  again.  The  foremost 
atmospheric  attenuation  propagation  code,  LOWTRAN,  has  defined  six  atmospheric  models  that 
cover  several  of  regions  of  interest  at  their  maximum  seasonal  differences.  [3]-[ll]  These  are 
summarized  in  Table  3-3.  These  six  models  will  show  up  in  virtually  all  of  the  charts  that  follow 
in  this  section. 


Table  3-3.  Ground-level  characteristics  of  LOWTRAN  climate  models.[l  1] 


LOW- 

TRAN 

MODEL 

NO. 

GEOGRAPHICAL 

CLIMATE 

MODEL 

(ATMOSPHERE) 

LOCATION 

AND 

MONTH 

PRESSURE 

(mbar) 

TEMPERATURE 

(K) 

RELATIVE 

HUMIDITY 

(%) 

ABSOLUTE 

HUMIDITY 

(g/m*3) 

WIND 

SPEED 

(m/a) 

1 

TROPICAL 

16*N,  ANNUAL  AVG. 

1013.0 

299.7 

76.63 

19.06 

4.1 

2 

MID-LATITUDE 

SUMMER 

46*N.  JULY 

1013.0 

294.2 

76.20 

13.94 

4.1 

3 

MID-LATITUDE 

WINTER 

46*N,  JANUARY 

1018.0 

272.2 

77.13 

3.60 

10.29 

4 

SUB-ARCTIC 

SUMMER 

60*N,  JULY 

1010.0 

287.2 

76.24 

9.10 

6.69 

6 

SUB-ARCTIC 

WINTER 

60*N,  JANUARY 

1013.0 

! 

267.2 

80.63 

1.20 

12.36 

6 

U.S.  STANDARD 

1962  U.S.  STANDARD 
ATMOSPHERE 

1013.0 

288.2 

46.89 

6.87 

7.2 

Figure  3-1  show  the  variation  of  temperature  with  altitude  over  the  range  0-120  km  for  the 
six  LOWTRAN  climate  models.  All  of  the  models  exhibit  the  same  qualitative  behavior.  From  sea 
level  the  temperature  falls  steadily  with  increasing  altitude  until  roughly  10  km  (possibly  as  high  as 
20  km)  altitude.  There  may  or  may  not  be  a thin  layer  in  which  the  temperature  is  virtually  constant 
with  increasing  altitude.  Next  the  nominal  downward  trend  reverses  and  the  temperature  increases 
with  increasing  altitude  until  roughly  50  km.  The  trend  reverses  a second  time  and  the  temperature 
falls  again  with  increasing  altitude  until  roughly  85  km  at  which  point  the  trend  reverses  for  a third 
time.  The  temperature  then  increases  rapidly  with  altitude  reaching  extremely  high  values  at 
altitudes  above  120  km. 
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Figure  3-1.  Atmospheric  temperature  as  a function  of  altitude.  The  divisions  of 
the  atmosphere  derive  from  the  characteristic  temperature  variations.[ll] 


TROPOPAUSE 


ALTITUDE  km 


TRP 

MLS 
^ MLW 
e-  SAS 
^ SAW 
^ STD 


It  should  be  noted  that  the  term  “temperature”  used  here  is  based  on  the  average  kinetic 
energies  of  the  particles.  At  each  altitude  the  particles  are  assumed  to  be  in  local  thermodynamic 
equilibrium.  The  particle  speeds  are  thus  given  by  the  Maxwell  speed  distribution  law 


p{y)  = Ati 


M 


,3/2 


271  kT 


^1^-Myp-HkT 


(3.3) 


where  M is  the  mass  of  the  particle  and  v is  its  speed.  If  is  the  average  particle  velocity,  then  the 
temperature  can  be  defined  as 

T - Ti:  Mv\^  !%k-  71 / Ik  (3.4) 

where  = MvJ/2  is  the  average  particle  kinetic  energy.  The  human  sensation  of  temperature 
depends  not  only  on  the  average  particle  energy,  but  also  on  the  rate  of  energy  transfer  to/from  our 
bodies.  This  depends  on  the  density  of  particles  (pressure)  as  well.  Even  a very  hot  “temperature” 
would  feel  cold  to  our  skin  at  the  low  pressures  of  the  upper  altitudes. 
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The  temperature  dependence  described  above  has  significance  because  it  form  the  basis  for 
dividing  the  atmosphere  into  discrete  layers  (as  indicated  very  approximately  in  Figure  3-1).  The 
lowest  layer  is  the  troposphere.  The  temperature  variations  that  commonly  occur  in  the  troposphere 
are  shown  in  Figure  3-2.  The  average  tropospheric  decrease  in  temperature  is  6.5  K per  km.  This 
is  true  over  most  of  the  earth.  However,  as  indicated  by  the  curve  for  sub-arctic  winter,  there  is 
usually  a thin  (1-2  km)  inversion  layer  over  snow-covered  ground.  It  should  be  noted  that  Figures 
3-1  through  3-9  in  this  section  were  plotted  using  data  extracted  directly  from  the  LOWTRAN7 
source  code.[l  1] 

The  troposphere  extends  from  sea  level  to  the  tropopause  at  an  altitude  typically  between  10 
and  20  km.  The  tropopause  is  defined  as  the  altitude  at  which  the  decrease  in  temperature  ceases 
and  is  replaced  by  either  an  inversion  (increase  in  temperature  with  altitude)  or  by  a neutral  layer 
(temperature  constant  with  altitude).  The  tropopause  marks  the  lower  edge  of  the  stratosphere,  the 
second  layer  extended  from  the  tropopause  to  roughly  50  km  altitude.  The  top  of  the  stratosphere 
is  called  the  stratopause  and  is  the  point  at  which  temperature  inversion  ceases  and  the  decrease  in 
temperature  with  altitude  increase  resumes.  Above  the  stratosphere  is  the  layer  referred  to  as  the 
mesosphere  that  extends  from  the  stratopause  to  roughly  80-90  km.  The  top  of  the  mesosphere  is 
the  mesopause  which  is  the  altitude  at  which  the  temperature  inversion  resumes.  Above  the 
mesopause  is  the  outer  layer  of  the  atmosphere  called  the  thermosphere  or  the  ionosphere. 

Distinctly  different  phenomena  are  associated  with  each  layer.  Most  of  the  phenomena  that 
are  associated  with  weather  occur  in  the  troposphere.  Virtually  all  fogs,  clouds,  and  precipitation, 
as  well  as  all  forms  of  wind  storms  (cyclones,  anti-cyclones,  and  tornados)  occur  in  the  troposphere. 
Winds  in  the  troposphere  tend  to  increase  as  altitude  increases.  The  jet  streams  are  narrow  bands 
of  extremely  strong  winds  that  exist  at  high  altitudes  below  the  tropopause.  Except  in  the  vicinity 
of  the  jet  streams  and  large  cyclonic  storms,  the  temperature  inversion  tends  to  prevent  the 
troposphere  and  the  stratosphere  from  having  much  interaction  or  exchange  of  materials.  Most 
phenomena  that  affect  sensor  performance  exist  in  the  troposphere.  However,  a few  important  ones 
do  not.  As  we  shall  see  shortly,  most  of  the  “ozone  layer”  lies  within  the  stratosphere.  The  same 
is  true  of  an  occasionally  significant  dust  layer  produced  by  volcanic  eruptions.  The  ionosphere,  by 
virtue  of  having  a significant  ionization,  will  affect  the  propagation  of  radio  waves.  The  increase 
and  decrease  of  this  ionization  as  the  sun  rises  and  sets  will  also  produces  diurnal  changes  in  the 
earth’s  magnetic  fields. 

The  temperature  decrease  with  increasing  altitude  in  the  troposphere  is  called  the  lapse  rate 
[12].  Any  increase  in  temperature  with  increasing  altitude  has  an  inverted  lapse  rate  and  is  called 
a temperature  inversion.  The  lapse  rate  is  whatever  temperature  profile  actually  exists.  The  global 
average  lapse  rate  is  roughly  6.4  °C  per  km  altitude  (2  °C  per  1000  feet).  It  has  no  direct  connection 
to  the  adiabatic  lapse  rate.  The  adiabatic  lapse  rate  is  the  change  in  temperature  that  a mass  of  air 
would  experience  as  it  increased  in  altitude  without  being  able  to  exchange  heat  with  its  surround- 
ings. Heating  or  cooling  depends  only  on  the  expansion  or  contraction  of  the  gas  as  the  altitude 
changes. 
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Figure  3-2.  Temperature  dependence  of  the  troposphere.[l  1] 


ALTITUDE  km 


TRP 

MLS 
^ MLW 
SAS 
^ SAW 
STD 


The  adiabatic  lapse  rate  is  roughly  9 °C  per  kilometer  (3  °C  per  1000  feet)  for  dry  air.  The 
atmospheric  lapse  rate  is  usually  less  than  the  adiabatic  rate  because  the  atmosphere  is  constantly 
in  motion.  Turbulence,  precipitation,  and  convection  all  lead  to  mixing  of  energy  content  from  one 
altitude  to  another.  Such  mixing  processes  violate  the  adiabatic  assumption.  The  adiabatic  lapse 
rate  is  the  temperature  change  in  a volume  of  gas  brought  about  by  vertical  motion.  If  the  adiabatic 
lapse  rate  is  greater  than  the  actual  lapse  rate  (for  example,  T^j^  = Tg^g  = 30  °C  at  sea  level,  T^j^  = 24 
“C  and  Tgj,3  = 21  “C  at  1000m),  a rising  volume  of  hot  air  will  cool  faster  than  the  surrounding  air. 
Its  buoyancy  will  therefore  be  less  than  the  surrounding  air  and  the  air  volume  will  fall.  This  is  a 
stable  atmospheric  condition.  If  the  adiabatic  lapse  rate  is  less  than  the  lapse  rate  of  the  surrounding 
air  (for  example,  T^j^  = Tgj,^  = 30  °C  at  sea  level,  T^,;^  = 18  °C  and  Tg^g  = 21  “C  at  1000m),  a rising 
volume  of  air  will  cool  slower  than  the  surrounding  air.  Its  buoyancy  will  be  higher  than  the 
surrounding  air,  and  it  will  continue  to  rise.  This  is  an  unstable  condition.  The  atmosphere  is  stable 
on  the  average.  However,  solar  heating  of  the  earth’s  surface  (and  the  air  in  contact  with  it)  or  the 
forced  flow  of  cold  air  masses  over  warmer  air  masses  can  both  produce  unstable  atmospheric 
condition.  Unstable  conditions  affects  the  local  weather  and,  as  we  shall  see  later,  they  affect  the 
employment  of  military  obscurants  (smoke)  and  chemical  and  biological  agents. 

The  pressure  variation  with  altitude  is  shown  in  Figure  3-3.  Pressure  does  not  exhibit  much 
variation  according  to  locale  and  tends  to  fall  off  exponentially  with  altitude.  The  exponential  scale 
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Figure  3-3.  Atmospheric  pressure  as  a function  of  altitude.  [1 1] 
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height  is  roughly  6.5  km.  That  is,  the  pressure  falls  off  as 

p[H  in  km)  = (3.5) 

His  the  altitude.  The  pressure  at  15  km  is  roughly  10%  of  the  pressure  at  sea  level.  It  can  be  safely 
assumed  that  the  uniformly-mixed  gases  (O2,  N2,  CO2,  etc.)  maintain  their  relative  abundances  at 
altitudes  up  to  80  km  (the  bottom  of  the  ionosphere).  Their  absolute  abundances  will  scale 
exponentially  with  altitude  exactly  as  the  pressure. 

Several  other  important  species  do  not  vary  in  such  well-behaved  fashion.  Figures  3-4  and 
3-5  show  the  altitude  dependence  of  water  vapor  concentration.  In  the  troposphere  water  vapor 
decreases  by  four  orders  of  magnitude  as  the  altitude  varies  from  sea  level  to  only  12  km.  That  is, 
99.99%  of  all  atmospheric  water  vapor  lies  below  12  km  altitude;  90%  lies  below  3 km  altitude. 
The  implication  of  this  is  that  if  a sensor  has  its  performance  limited  by  water  vapor  for  sea  level 
operation,  then  the  performance  of  that  sensor  at  high  altitudes  may  not  suffer  the  same  limitations 
- there  is  no  water  vapor  at  high  altitudes.  The  same  sensor  operated  at  sea  level  may  be  useless 
against  surface  targets,  but  have  acceptable  performance  against  high  altitude  targets  because  of  the 
differences  in  integrated  concentration  along  the  slant  path  versus  the  horizontal  path. 

Another  molecular  species  whose  concentration  does  not  exponentially  decrease  with 
increasing  altitude  is  ozone.  The  altitude  dependence  of  ozone  concentration  is  shown  in  Figures 
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3-6  and  3-7.  There  is  a significant  concentration  of  ozone  at  sea  level.  Some  but  not  all  of  this  is 
man-made.  A considerable  amount  is  produced  by  atmospheric  electrical  activity.  The  majority, 
however,  is  produced  in  a thick  layer  surrounding  the  tropopause.  This  layer  is  commonly  referred 
to  as  the  ozone  layer.  The  ozone  layer  is  roughly  10  km  thick  and  centered  around  20  km  altitude. 
In  the  ozone  layer,  solar  ultraviolet  radiation  breaks  atmospheric  molecular  oxygen  into  oxygen 
atoms.  When  one  of  these  atoms  collides  with  an  oxygen  molecule,  an  ozone  molecule  is  formed. 
Ozone  has  the  useful  property  of  strongly  absorbing  ultraviolet  radiation  between  200  nm  and  300 
nm.  Such  radiation  is  extremely  harmful  to  life  causing  cell  death  and  genetic  mutations.  It  is  also 
a potent  skin  carcinogen.  Without  this  layer,  life  on  the  earth’s  surface  may  not  be  possible.  The 
absorption  of  solar  ultraviolet  radiation  in  the  stratosphere  is  a major  contributor  to  the  temperature 
inversion  that  is  characteristic  of  this  layer  of  the  atmosphere.  A side  effect  of  the  strong  ultraviolet 
absorption  by  the  ozone  layer,  is  the  creation  of  a “solar  blind’’  spectral  region  from  200-300  nm  at 
sea  level.  The  solar  absorption  is  so  strong  that  the  world  is  pitch  black  in  this  spectral  region.  Only 
fires  and  extremely  hot  objects  emit  any  detectable  amount  of  radiation  in  this  region. 

Another  class  of  atmospheric  species  that  is  non-uniform  in  its  altitude  dependence  is  the 
class  of  atmospheric  aerosols.  The  altitude  dependence  of  the  major  background  aerosols  is  shown 
for  the  fall  and  winter  seasons  in  Figure  3-8  and  the  spring  and  summer  seasons  in  Figure  3-9.  The 
boundary  layer  aerosols  are  confined  almost  entirely  within  the  first  2000  m of  altitude.  It  takes  a 
truly  massive  storm  for  dust,  haze,  and  other  aerosols  to  be  lifted  to  higher  altitudes.  Sensors  that 
are  severely  affected  by  limited  visibility  due  to  high  aerosol  concentrations  at  the  surface  may  work 
very  well  in  almost  every  airborne  application.  The  explicit  dependence  on  visibility  indicates  that 
the  extinction  coefficient  is  a linear  function  of  the  aerosol  concentration.  Above  2000  m there  is 
a background  aerosol  whose  concentrations  fall  off  more  or  less  smoothly.  In  the  stratosphere,  this 
aerosol  is  primarily  residual  volcanic  dust.  In  the  mesosphere  and  above,  the  volcanic  component 
is  gradually  replaced  by  meteoritic  dust.  Right  in  the  vicinity  of  the  tropopause,  there  is  frequently 
a relatively  thick  layer  of  volcanic  dust  caused  by  recent  (within  months  to  a few  years  at  most) 
eruptions.  If  an  eruption  is  violent  enough  to  cause  much  of  its  ash  to  reach  the  tropopause,  the  jet 
stream  will  blow  it  into  a band  that  circles  the  world  and  gradually  spreads  out.  The  strength  of  this 
layer  can  vary  by  two  or  even  three  orders  of  magnitude  above  the  background  levels.  Over  time, 
the  volcano  dust  will  gradually  settle  out. 

With  the  exception  of  rare  nacreous  clouds  and  noctilucent  clouds,  clouds  are  a strictly 
tropospheric  phenomenon.  Most  clouds  are  restricted  to  thin  layers  of  the  atmosphere  (typically  a 
few  hundred  meters  to  a few  kilometers  thick).  Fogs  are  clouds  that  form  at  the  surface  and  extend 
upwards  a few  meters  to  a few  hundred  meters.  Clouds  that  do  not  contact  the  ground  tend  to  form 
in  three  regions:  low  (0.5-2  km  cloud  base),  medium  (2  -7  km  cloud  base),  and  high  (5- 1 3 km  cloud 
base).  All  three  types  of  clouds  may  be  present  at  the  same  time.  Only  cumulus,  stratocumulus,  and 
cumulonimbus  clouds  tend  to  be  more  than  a few  kilometers  thick.  The  tops  of  cumulonimbus 
clouds  may  extend  slightly  into  the  stratosphere.  Nacreous  clouds  occur  high  in  the  stratosphere. 
Having  an  iridescent  “mother  of  pearl’’  appearance,  nacreous  clouds  appear  to  be  associated  with 
air  masses  forced  violently  upward  by  high  mountain  ranges.  Noctilucent  clouds  are  tenuous, 
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Figure  3-4.  Atmospheric  water  vapor  concentration  as  a function  of  altitude.  [1 1] 
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Figure  3-5.  Altitude  dependence  of  tropospheric  water  vapor  concentration.[l  1] 
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e 3-7.  Altitude  dependence  of  tropospheric  and  stratospheric  ozone  concentratior 
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Figure  3-8.  Aerosol  concentrations  as  a function  of  altitude  during  the  fall  and  winter.[l  1] 
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Figure  3-9.  Aerosol  concentrations  as  a function  of  altitude  during  the  spring  and  summer.[l  1] 
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cirrostratus-like  clouds  occurring  at  the  mesopause  (80-90  km).  Their  origin  is  uncertain.  They  can 
be  seen  only  when  directly  illuminated  by  sunlight  against  a twilight  sky  in  the  absence  of 
intervening  lower  clouds. 

Rain  tends  to  be  associated  with  low  or  medium  clouds.  Rain  also  tend  to  come  in  two 
forms:  steady  rains  of  light  to  moderate  intensity  and  intense  rains  of  short  duration.  The  former 
tend  to  occur  from  most  low  and  medium  cloud  types.  The  rain  density  is  zero  at  the  cloud  top  (2- 1 0 
km),  increases  toward  the  cloud  bottom,  and  then  remains  steady  or  slightly  decreasing  (due  to 
potential  droplet  evaporation)  until  the  ground.  The  latter  rain  type  tends  to  be  associated  with  large 
cumulus  or  cumulonimbus  (thunderstorm)  clouds.  Here  the  rain  (or  frozen  rain)  extends  from  near 
the  tropopause  all  the  way  down  to  the  ground).  Snow  has  much  the  same  altitude  characteristics 
as  rain.  Hail  is  only  associated  with  cumulonimbus  clouds  and  will  again  be  present  from  the 
ground  to  near  the  tropopause. 

The  geographic  (latitude  and  longitude)  structure  of  the  atmosphere  is  strongly  tied  to 
whether  an  area  is  over  the  ocean  or  the  land,  whether  it  is  equatorial,  polar  or  in  between,  whether 
it  is  coastal  or  inland,  and  where  it  is  located  with  respect  to  mountain  ranges  and  prevailing  winds 
and  ocean  currents.  There  have  been  a number  of  classifications  of  global  climate  into  regions.  One 
substantial  classification  is  with  respect  to  total  rainfall,  rainfall  rate,  and  latitude.  As  shown  in 
Figure  3-10,  this  classification  scheme  creates  eight  basic  climate  types  based  on  latitude  (and  thus 
range  of  temperatures)  and  total  rainfall.  Subsequent  studies  of  several  of  the  indicated  regions  has 
broken  the  eight  basic  climate  types  into  a number  of  sub-types.  Some  of  this  data  is  included  in  the 
section  on  hydrometeors  later  in  this  chapter. 
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Figure  3-10.  Global  climate  regions. [13] 
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Figure  3-10  shows  a calculation  of  the  atmospheric  extinction  for  a hypothetical  atmosphere 
(r=  293. 15  K,/)  = 1013  mbar,  and  h = 1.5  g/m^  H2O  concentration)  with  no  particulates  of  any  kind. 
[ 1 4]  Molecular  scattering  has  been  ignored,  although  its  inclusion  would  only  produce  a tiny  change 
(<  0.1  dB/km)  in  the  region  to  the  right  of  the  1 pm  tick  mark.  Thus  Figure  3-10  shows  only  the 
molecular  absorption  of  the  atmosphere.  One  thing  is  immediately  apparent.  There  are  two  broad 
regions  where  the  absorption  is  non-existent.  One  is  in  the  radio  frequency  and  microwave  region 
and  the  second  is  in  the  visible.  These  regions  of  low  absorption  are  called  “atmospheric  windows”. 
The  second  observation  is  a region  from  roughly  20  pm  to  500  pm  where  the  attenuation  routinely 
exceeds  100  dB/km.  The  attenuation  is  so  high  that  virtually  no  atmospheric  sensors  are  ever  built 
to  operate  in  this  regime.  A third  observation  is  that  there  are  a number  of  strong  absorption  peaks. 
In  the  microwave  region,  the  absorptions  have  been  identified  as  being  due  to  oxygen  and  water 
vapor.  In  the  infrared  region,  the  absorptions  are  due  mostly  to  carbon  dioxide  and  water  vapor. 
The  deepest  valleys  between  some  of  the  absorption  peaks  are  also  called  atmospheric  windows. 
Prominent  examples  are  near  35  GHz,  95  GHz,  8-14  pm,  4. 5-5. 5 pm,  and  3 -4  pm.  Atmospheric 
windows  play  an  important  role  in  sensor  design;  sensors  are  designed  to  operate  in  atmospheric 


Figure  3-10.  Molecular  absorption  of  the  atmosphere.  [14] 


wQvelength:|  centimeter  j ^millimeter  ^ | ^ub-millimetyj 


infrared 


yisible 


56 


windows.  It  is  unusual  that  any  sensor  designed  to  operate  outside  of  an  atmosphere  window  would 
function  properly  anywhere  in  the  atmosphere.  In  fact,  even  nature  takes  advantage  of  atmospheric 
windows.  Most  animals  have  eyes  that  see  in  the  visible  specifically  because  there  is  an  atmospheric 
window  in  the  visible. 

To  have  a thorough  understanding  of  the  effects  of  atmospheric  propagation  on  sensor 
performance  we  need  to  study  the  absorption  lines  in  the  microwave  region  and  the  infrared  region 
in  more  detail.  Let  us  begin  in  the  microwave.  One  of  the  best  sources  of  attenuation  information 
is  a model  developed  by  Crane  [15].  Microwave  attenuation  data  was  collected  by  analysis  of 
satellite  communication  signal-to-noise  ratios.  Crane  analyzed  the  data  to  develop  a semi-empirical 
model  for  predicting  microwave  attenuation.  Crane’s  coefficients  were  determined  by  regression 
of  the  measured  propagation  data.  He  assumed  that  the  major  contributors  to  variation  in  attenuation 
would  be  frequency,  humidity,  and  temperature  (the  widths  of  the  absorption  peaks  will  be 
temperature  dependent  because  they  are  pressure -broadened  and  the  collision  frequency  depends 
on  the  average  molecular  velocity).  The  attenuation  was  assumed  to  scale  linearly  with  the 
atmospheric  pressure.  Since  sea  level  pressure  does  not  vary  more  than  a few  percent,  except  in 
hurricanes,  it  was  not  included  in  the  regression.  It  is  easy  to  scale  any  of  the  following  results  to 
pressures  other  than  1013  mbar  if  desired. 

In  the  following  we  assume  that  T=  ambient  temperature  (in  K),  h = absolute  humidity  (in 
g/m^),  v=  microwave  frequency  (GHz),  and  the  ambient  pressure  is  1013  mbar.  The  attenuation  per 
km  at  sea  level  (in  dB/km)  is  assumed  to  have  the  form 

Y[v)  = a(v)  + b(v)-h-c{v)-(T-  273.1 5)  (3.6) 

where  a{  v),  b(v),  and  c(v)  are  frequency  dependent  regression  coefficients.  The  attenuation  over 
earth-to-space  path  at  the  zenith  (in  dB)  is  assumed  to  have  the  form 

A(v,  90°)  ^a(v)  + /3(v)-h-s(v)-(T-  273.1 5)  (3.7) 

where  a(  v),  /Ay),  and  £(v)  are  the  regression  coefficient  for  this  path  type.  An  atmospheric  scale 
height  (in  km)  may  be  calculated  from 

H(v)^  A(v,90°)l r(v).  (3.8) 

Using  this  scale  height,  the  attenuation  over  a slant  path  to  space  (in  dB)  may  be  found  from  one  of 
the  following  two  relations 


A{v,e)  ^ = ^(v,90°)  csc6> 


6»>  10“  (3.9a) 


or 
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A{v,0)  = 


2A(v,90°) 

sin6>+  sin^  6>  + (^2//(v)/8500) 


6»<  10° 


(3.9b) 


It  should  be  noted  that  because  this  is  a regression  based  on  real  data,  residual  attenuations  (i.e., 
those  for  which  h=0  or  T=2  75. 7 5 AT)  will  have  considerably  larger  errors  than  attenuations  at  normal 
conditions.  This  will  be  obvious  in  the  charts  that  follow. 

Table  3-4  shows  Crane’s  regression  coefficients  at  a number  of  frequencies.  Should  the 
reader  have  the  need  to  obtain  accurate  estimates  at  frequencies  other  than  those  listed,  the  following 
interpolation  procedure  should  be  used.  Y will  denote  a coefficient  value,  and  v will  denote  the 
frequency.  First,  a slope  and  intercept  are  calculated  using  values  from  the  table. 


and 


m - log[n  / F2]  / log[  vl  / v2\ 
b - log(72)  - m ■ log(  v2) 


Slope 

Intercept 


(3.10) 

(3.11) 


Then,  the  interpolated  value  is  obtained  by  direct  substitution  into  the  relation 
log[7(v)]  = m-log(v)  -I-  b . 


(3.12) 


Cranes ’s  interpolation  technique  is  more  accurate  than  simple  linear  interpolation  because  the 
attenuation  coefficients  are  logarithmic  quantities.  Thus  a logarithmic  interpolation  is  more 
accurate. 


Figure  3-11  shows  the  calculated  attenuation  coefficients  versus  frequency  for  four  different 
humidity  conditions.  Note  that  the  /?  = 0 curve  has  small  bumps  at  frequencies  we  have  already 
associated  with  as  being  due  entirely  to  water  vapor.  At  zero  humidity  there  should  be  no  absorption 
peaks  at  these  frequencies.  The  small  bumps  that  show  up  are  artifacts  of  the  regression.  Figure  3- 
1 2 shows  the  analogous  plots  for  earth-to-space  zenith  paths.  Eqs.  (3.6)  through  (3.12)  are  routinely 
incorporated  into  computer  codes  that  are  used  to  predict  the  performance  of  microwave  sensors  and 
microwave  communication  links.  We  will  refer  back  to  these  results  when  we  begin  to  discuss 
sensors  operating  in  the  microwave  region. 

In  the  preceding  we  have  used  the  concept  of  absolute  humidity.  Absolute  humidity  is  the 
amount  of  water  vapor  (in  g/m^)  in  the  air.  At  any  temperature,  there  is  a maximum  absolute 
humidity  which  corresponds  to  complete  saturation  of  the  air  by  water  vapor.  This  absolute 
humidity  corresponding  to  air  saturated  with  water  vapor  can  be  computed  from  the  expression  [16] 
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Table  3-4.  Crane’s  regression  coefficients  vs.  frequency  for  microwave  attenuation 
over  horizontal  paths  (a,  b,  c)  and  earth-to-space  zenith  paths  (a,  P,  s).  [15] 


V tGHzl 

a(y} 

My} 

c{y} 

g(v) 

m 

My} 

1 

0.00588 

0.0000178 

0.0000517 

0.0334 

0.0000028 

0.000112 

4 

0.00802 

0.000141 

0.000085 

0.0397 

0.000276 

0.000176 

6 

0.00824 

0.0003 

0.0000895 

0.0404 

0.000651 

0.000196 

10* 

0.00878 

0.000919 

0.000103 

0.0427 

0.00210 

0.000278 

12 

0.00898 

0.00137 

0.000108 

0.0436 

0.00318 

0.000315 

15 

0.00953 

0.00269 

0.000125 

0.0461 

0.00634 

0.000455 

16 

0.00976 

0.00345 

0.000133 

0.0472 

0.00821 

0.000536 

20 

0.0125 

0.0125 

0.000101 

0.056 

0.0346 

0.00155 

22 

0.181 

0.0221 

0.000129 

0.076 

0.0783 

0.0031 

24 

0.0162 

0.0203 

0.0000563 

0.0691 

0.0591 

0.0025 

30 

0.0179 

0.01 

0.00028 

0.085 

0.0237 

0.00133 

35 

0.0264 

0.0101 

0.000369 

0.123 

0.0237 

0.00149 

41 

0.0499 

0.0121 

0.00062 

0.237 

0.0284 

0.00211 

45 

0.0892 

0.014 

0.00102 

0.426 

0.0328 

0.00299 

50 

0.267 

0.0171 

0.00251 

1.27 

0.0392 

0.00572 

55 

3.93 

0.022 

0.0158 

24.5 

0.049 

-0.00121 

60* 

1.797 

0.0252 

0.00999 

10.17 

0.0566 

0.0027 

70 

0.449 

0.0319 

0.00443 

2.14 

0.0732 

0.0104 

80 

0.16 

0.0391 

0.0013 

0.705 

0.0959 

0.00586 

90 

0.113 

0.0495 

0.000744 

0.458 

0.122 

0.00574 

94 

0.106 

0.054 

0.000641 

0.417 

0.133 

0.00594 

100* 

0.110 

0.0614 

0.000642 

0.422 

0.151 

0.00663 

110 

0.116 

0.0749 

0.000644 

0.431 

0.185 

0.00785 

115 

0.206 

0.0826 

0.00185 

0.893 

0.203 

0.0113 

120 

0.985 

0.0931 

0.0115 

5.35 

0.221 

0.0363 

140 

0.123 

0.129 

0.000372 
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Figure  3-11.  Microwave  attenuation  coefficient  versus  frequency  for  horizontal  sea  level  paths. 


Figure  3-12.  Microwave  attenuation  coefficient  versus  frequency  for  surface-to-space  zenith  path. 
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and  is  shown  in  Figure  3-13.  If  the  air  has  a known  amount  of  absolute  humidity,  this  same  eurve 
may  be  used  to  find  the  dewpoint.  The  dewpoint  is  the  temperature  at  which  cooling  air  becomes 
saturated  with  a fixed  absolute  amount  of  water  vapor.  If  the  temperature  drops  below  the  dewpoint, 
some  of  the  water  must  condense  out  as  fog  or  dew.  The  highest  recorded  dewpoint  is  roughly  34 
“C.  This  corresponds  to  37  g/m^  of  absolute  humidity.  If  a problem  specification  requires  outdoors 
operation  in  absolute  humidities  greater  than  37  g/m^,  then  the  problem  specification  is  suspect. 

Relative  humidity  is  a term  commonly  used  in  news  hour  weather  reports.  It  is  defined  as 

Relative  Humidity  = Absolute  Humidity /Saturation  Humidity  x 100%  (3.14) 

For  example,  if  the  temperature  is  30  “C,  then  the  saturation  humidity  is  30  g/m^  If  at  the  same  time 
the  absolute  humidity  is  20  g/m^,  then  the  relative  humidity  is  20/30  x 1 00%  = 67%.  If  the  dewpoint 
is  10  °C,  then  the  absolute  humidity  is  9.5  g/m^  If  the  actual  temperature  is  20  °C,  then  the 
saturation  humidity  is  approximately  17  g/m^  The  relative  humidity  is  thus  9.5/17  x 100%  = 56%. 

The  attenuation  coefficient  in  the  infrared  is  seldom  evaluated  analytically  or  even  semi- 
empirically.  Its  calculation  is  usually  performed  using  one  of  a few  computer  programs  that  have 
been  developed  over  the  course  of  the  last  several  decades.  LOWTRAN  is  the  most  widely  known. 
[3]-[ll]  It  was  developed  by  the  then  Air  Force  Geophysics  Laboratory  for  low  resolution 
(>20  cm  ')  atmospheric  transmission  and  atmospheric  radiance  analyses.  It  uses  band  model 
calculations  for  frequencies  (wavelengths)  from 

0 < Av  < 50,000 cm'^ 

0.2  jum  < A < GO 

0<  V < 1.499x10'' Hz 


LOWTRAN  will  provide  outputs  for  frequencies  in  the  microwave  region.  However,  the  poor 
resolution  (20  cm  ')  makes  all  data  below  600  GHz  or  above  500  pm  to  have  questionable  validity. 
LOWTRAN  includes:  - 6 user-selectable  atmospheric  climate  models  with  33-layer  atmospheres 

(see  Table  3-3) 

- Numerous  user-selectable  aerosol  models 

- Rain,  fog,  and  cloud  models 

- Multiple  scattering  solar  irradiance  contributions 

- AFGL  (0-120  km)  atmospheric  profiles 


61 


Figure  3-13.  Absolute  humidity  of  saturated  air  as  a function  of  temperature.  [16] 


-15  -10  -5  0 5 10  15  20  25  30  35  40  45  50 

TEMPERATURE  or  DEWPOINT  Celsius 


- User-specified  path  geometries  (inch  horizontal,  slant,  surface-to-space,  etc.) 

- Atmospheric  refraction 

- User-definition  of  atmospheric  or  aerosol  models  is  permitted. 
LOWTRAN  is  written  in  FORTRAN.  The  latest  version  is  L0WTRAN7  which  was  released  in 
Spring  1 989.  A PC-based  executable  version  is  available  from  Ontar  Corporation  in  Brookline,  MA 
as  is  the  original  FORTRAN  source  code. [1 1]  The  Air  Force  selected  Ontar  to  be  the  long-term 
support  contractor  for  this  code.  The  source  code  contains  a wealth  of  hidden  information 
(incorporated  as  block  data  files)  on  the  atmosphere  and  the  various  models  used  in  the  calculations. 
This  data  extracted  from  the  L0WTRAN7  source  code  was  used  to  create  many  of  the  charts  in  this 
Chapter. 

LOWTRAN  has  now  been  replaced  by  M0DTRAN.[17]  MODTRAN  is  a moderate 
resolution  version  of  LOWTRAN.  It  uses  almost  all  of  the  same  LOWTRAN  models  but  permits 
calculations  to  1 cm  ' resolution.  MODTRAN  outputs  for  frequencies  below  30  GHz  and 
wavelengths  longer  than  1 cm  are  still  suspect.  Several  versions  of  MODTRAN  are  also  available 
from  Ontar. [18] 

HITRAN  [19]  was  the  high  resolution  companion  to  LOWTRAN.  It  used  a completely 
different  scheme  (rigorous  line-by-line  addition  of  spectral  contributions  as  opposed  to  a simpler 
absorption  band  model)  for  computing  molecular  absorption.  This  permits  calculations  to  any 
resolution  desired  by  the  user.  Otherwise  it  used  most  of  the  same  models  as  LOWTRAN  (such  as 
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the  path  models  and  models  for  aerosols,  fogs,  elouds,  rain,  and  refraction).  HITRAN  is  extremely 
useful  for  calculating  transmission  of  laser  outputs.  HITRAN  was  replaced  a number  of  years  ago 
by  FASCODE  [20]  which  uses  the  same  scheme  as  HITRAN  but  implements  a much  faster 
computational  algorithm.  FASCODE  and  HITRAN  both  are  available  from  Ontar.[21],[22] 

There  are  other  atmospheric  transmission  codes,  such  as  the  Army’s  EOSAEL  family. [23] 
However,  the  author  has  never  used  them  and  cannot  report  their  characteristics  with  any  certainty. 

Let  us  now  examine  the  visible  and  infrared  portions  of  the  spectrum.  Figure  3-14  shows 
the  attenuation  in  the  spectral  regions  from  0.4  to  2.0  pm  and  from  2.0  to  20.0  pm.  The  calculations 
were  performed  using  LOWTRAN7  for  the  specific  inputs  shown  in  the  figure  caption.  Note  that 
the  wavelength  scale  is  different  for  the  two  plots.  Transmittance  is  the  fraction  of  the  incident 
radiation  that  is  not  absorbed  or  scattered.  It  is  directly  related  to  the  extinction  coefficient  a 
through  the  relation 

T = (3.15) 


Figure  3-14.  LOWTRAN7  calculation  of  the  transmittance  of  the  atmosphere  in  the  infrared. 
The  calculation  assumes  the  mid-latitude  winter  climate  model,  with  a rural 
aerosol  model  with  23  km  visibility,  and  a 1 km  horizontal  path  at  sea  level.[l  1] 


WAVELENGTH  (MICROMETER) 


63 


where  R (range)  is  the  distance  traveled.  There  is  obviously  considerable  structure.  In  fact,  in 
several  wavelength  regions  there  are  two,  three,  or  even  four  different  species  making  significant 
contributions  to  the  extinction. 

A better  understanding  of  the  qualitative  character  can  be  obtained  by  using  the  same 
LOWTRAN  7 output  to  identify  the  contributions  from  each  species.  This  is  shown  in  figures  3-15 
and  3- 1 6.  Figure  3-15  covers  the  visible  and  near  infrared  spectrum.  There  are  a few  isolated  peaks 
associated  with  the  uniformly-mixed  gases.  The  one  sharp  line  at  roughly  0.78  pm  is  due  to  oxygen; 
the  remaining  peaks  are  due  to  carbon  dioxide.  Water  vapor  is  seen  to  be  the  major  contributor  to 
the  extinction  structure  with  a number  of  regularly  spaced  absorption  bands.  Each  band  contains 
a myriad  of  narrower  absorption  lines.  The  water  vapor  continuum  absorption  is  seen  to  be  a 
significant  contributor  although  not  as  important  as  the  absorption  from  isolated  water  molecules. 
Molecular  (Rayleigh)  scattering  is  seen  to  be  weak  contributor,  showing  up  only  as  having  a 
significant  effect  below  0.5  pm.  Aerosol  scattering  is  much  stronger,  contributing  a smooth 
background  extinction  throughout  the  visible  and  near  infrared. 

The  mid  infrared  (Figure  3-16)  presents  an  even  more  complicated  picture.  There  are  strong 
absorptions  from  the  uniformly-mixed  gases.  In  this  instance  the  absorber  is  almost  entirely  CO2. 
Water  vapor  and  the  water  vapor  continuum  provided  strong  absorption  at  a number  of  bands  in  this 
region.  There  is  a limited  aerosol  extinction  virtually  everywhere.  Please  note  that  the  visibility  in 
Figure  3-16  was  chosen  as  5 km;  in  Figure  3-15  it  was  23  km.  The  choice  of  lower  visibility  makes 
the  aerosol  contribution  more  pronounced  without  affecting  anything  else.  Ozone  and  nitrogen 
continuum  absorption  contribute  small  amounts  of  absorption  in  two  places.  A brief  summary  of 
the  character  of  infrared  molecular  absorption  could  read:  no  significant  molecular  absorption  in  the 
visible;  broad  but  well-spaced  water  vapor  absorption  bands  dominate  the  near  infrared;  carbon 
dioxide  and  water  vapor  vie  for  strongest  absorber  throughout  most  of  the  mid  infrared  with  only 
a few  atmospheric  windows  being  relatively  free  of  absorption. 
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Aerosol  Scattering 

Figures  3-15  and  3-16  each  have  one  graph  that  shows  the  effects  of  aerosol  scattering  on 
the  visible  and  infrared  transmission  through  the  atmosphere.  The  aerosols  addressed  by  these 
graphs  are  the  so-called  “haze”  aerosols.  These  aerosols  are  formed  naturally  from  condensation 
of  atmospheric  water  vapor  around  tiny  particles  of  soot,  salt,  organic  condensates,  or  dust  (of  soil, 
meteoritic,  or  volcanic  origin).  Depending  on  the  origin  of  an  aerosol,  it  can  have  vastly  different 
attenuation  characteristics.  Figure  3-17  shows  the  relative  extinction  coefficient  as  a function  of 
wavelength  for  the  four  major  tropospheric  aerosol  models  used  by  LOWTRAN.  Before  examining 
the  differences  between  the  aerosols  it  is  important  to  understand  the  meaning  of  the  term  “relative” 
in  our  description.  Note  that  all  four  aerosols  have  a relative  extinction  of  “1.0"  at  0.55  pm.  This 
wavelength  is  the  wavelength  at  which  meteorological  range  (visibility)  is  defined.  The  extinction 
at  any  other  wavelength  is  relative  to  the  extinction  at  0.55  pm.  Given  a declaration  of  visibility 
from  some  source,  the  extinction  coefficient  at  0.55  pm  can  be  determined  fromEq.  (3.1).  This  value 
then  defines  the  absolute  extinction  coefficient  at  0.55  pm.  Multiplying  any  relative  extinction 
coefficient  at  wavelength  A (from  the  curves)  by  the  absolute  extinction  coefficient  at  0.55  pm  gives 
the  absolute  extinction  coefficient  at  wavelength  A.  This  necessity  to  know  the  visibility  in  order 
to  predict  the  aerosol  attenuation  at  other  wavelengths  is  one  of  the  peculiarities  of  LOWTRAN. 
It  is  also  one  region  that  visibility  remains  a critical  meteorological  parameter  in  estimating  sensor 
performance. 


Figure  3-17.  Relative  extinction  coefficient  of  tropospheric  aerosols.[l  1] 
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Figure  3-18.  Correlation  of  visibility  and  atmospheric  extinction  at  10.6  pm.[24] 


Figure  3-18  shows  a correlation  of  visibility  with  measured  atmospheric  extinction  at  10.6 
pm.  For  visibilities  below  3 km,  the  curve  is  a virtually  perfect  fit  to 

Extinction  = Constant  / Visibility.  (3.16) 

At  visibilities  above  3 km,  the  attenuation  becomes  more  and  more  dominated  by  molecular 
extinction.  This  explains  the  slight  hook  at  one  end  of  the  plot.  Correlations  like  this  have 
reinforced  time  and  again  the  ability  to  use  visibility  to  predict  extinction  at  other  wavelengths. 

Examination  of  the  curves  in  Figure  3-17  indicates  that  depending  on  the  source  of  the 
aerosol,  there  can  be  more  than  a factor  of  ten  difference  in  extinction  in  the  3-5  pm  region  of  the 
infrared,  although  a factor  of  five  is  more  characteristic  in  the  8-12  pm  region.  Figure  3-19  shows 
the  relative  absorption  coefficient  of  the  tropospheric  aerosols.  Relative  in  the  sense  of  Figure  3-19 
refers  to  the  fraction  of  the  total  extinction  produced  by  absorption.  The  extinction  coefficient  is  the 
sum  of  the  absorption  and  scattering  coefficients 

^ ~ ^ ABsi'^)  ^ SCAli'^)  (3-17) 
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If  A(X)  is  the  relative  extinction  coefficient  at  wavelength  A,  and  B(X)  is  the  relative  absorption 
coefficient,  then 


and 


cCext{^)  - ct£;fr(0.55//m) 

^ ABS  (Z)  = B(X) 

OC  ext  (X). 


(3.18) 

(3.19) 


Examination  of  Figure  3-19  shows  that  the  relative  absorption  is  small  for  all  of  the  tropospheric 
aerosols.  That  is,  the  tropospheric  aerosols  have  considerably  more  scattering  than  absorption.  This 
is  especially  true  in  the  3-5  and  8-12  pm  bands. 

Since  most  tropospheric  aerosols  have  a significant  water  content,  it  might  be  expected  that 
relative  humidity  would  strongly  affect  the  extinction.  Figure  3-20  shows  the  relative  extinction 
coefficient  of  maritime  aerosols  (nominally  salt  water  aerosols)  as  a function  of  relative  humidity, 
while  Figure  3-21  shows  the  relative  absorption  for  those  same  aerosols  under  the  same  conditions. 
As  expected,  relative  humidity  does  have  a significant  impact  on  both  the  extinction  coefficient  and 
absorption  of  the  maritime  aerosol.  In  general,  higher  humidity  leads  to  higher  extinctions  and 
higher  absorption  in  the  infrared.  Similar  humidity  effects  are  observed  with  the  other  aerosols. 


Figure  3-19.  Relative  absorption  coefficient  of  tropospheric  aerosols.  [1 1] 
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Figure  3-20.  Relative  extinction  of  maritime  aerosols  as  a function  of  relative  humidity.  [1 1] 
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. Relative  absorption  of  maritime  aerosols  as  a function  of  relative  humidity.[l  1] 
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Hydrometeors  - Fogs  and  Clouds 

As  defined  earlier,  hydrometeors  are  “water  in  the  air”,  that  is,  rain,  fog,  clouds,  snow,  etc. 
In  an  earlier  section  we  saw  how  important  the  absolute  humidity  was  in  defining  the  molecular 
absorption  contribution  to  the  total  extinction.  Just  as  important,  however,  is  the  fraction  of  that 
humidity  that  is  in  the  form  of  liquid  water.  Figure  3-23  compares  the  attenuation  of  liquid  water 
with  that  of  water  vapor  as  a function  of  total  water  content  (humidity).  As  is  evident,  the 
attenuation  of  the  liquid  is  four  orders  of  magnitude  greater  than  the  vapor.  This  should  not  come 
as  a complete  surprise.  Remember  how  important  the  water  vapor  continuum  was  to  absorption  in 
the  infrared.  That  extinction,  which  was  comparable  in  magnitude  to  the  direct  vapor  absorption  was 
caused  by  a relatively  small  number  of  water  molecules  in  collision  with  each  other.  In  liquid  water, 
every  molecule  is  in  continual  collision  with  its  neighbors.  The  continuum  absorption  dominates 
all  other  effects.  It  does  so  to  such  a great  deal  that  liquid  water  is  a strong  absorber  from  the  yellow 


Figure  3-22.  Comparison  of  the  attenuation  produced  by  different 
concentrations  of  liquid  water  and  water  vapor. 


71 


region  of  the  visible  to  the  VLF  region  of  the  radio  speetrum.  The  upshot  of  this  is  that  if  there  is 
only  0.01  g/m^  of  liquid  water  in  the  air,  its  extinction  will  be  comparable  to  that  of  30  g/m^  of  water 
vapor.  Furthermore,  clouds  and  fogs  can  have  liquid  water  content  exceeding  1 g/m^.  Thus, 
hydrometeors  can  be  the  dominant  source  of  weather-related  attenuation  in  some  portions  of  the 
electromagnetic  spectrum. 

There  are  two  basic  classes  of  hydrometeors:  the  small  particles  that  remain  suspended  in 
the  air  for  minutes  to  hours  and  the  large  particles  that  fall  to  the  surface  within  seconds  to  minutes. 
Figure  3-23  shows  the  terminal  velocity  of  water  droplets  as  a function  of  their  size.[25]  Note  that 
droplets  with  radii  greater  than  100  pm  have  terminal  velocities  greater  than  1 m/s.  Unless  driven 
upwards  by  strong  winds,  these  large  droplets  will  fall  the  few  kilometers  to  the  ground  in  a few  tens 
of  minutes  or  less.  They  make  up  rain  and  drizzle.  Particles  less  than  10  pm  have  terminal 
velocities  less  than  1 cm/s.  These  small  particles  will  take  minutes  to  settle  a few  meters.  Their 
motion  will  be  dominated  more  by  wind  motion  (even  mild  breezes)  than  to  their  downward  settling. 
These  particles  make  up  the  majority  of  fogs  and  clouds.  The  particles  in  the  intermediate  zone, 
those  tens  of  pm  in  diameter,  will  settle  out  of  low-lying  hydrometeor  formations  in  a few  minutes. 
They  form  the  “mist”  component  of  fogs  and  are  the  particles  which  rapidly  coalesce  into  larger 
raindrops  that  will  fall  rapidly  out  of  the  clouds.  Snow  particles  do  follow  the  same  curves.  Almost 
all  falling  snow  flakes  will  have  velocities  of  1-2  m/s.  Graupel  velocities  can  approach  2-3  m/s. 


Figure  3-23.  Terminal  velocity  versus  droplet  diameter  for  water  droplets.  [25] 
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It  should  be  noted  that  the  theoretical  terminal  velocity  of  a spherical  particle  scales  as  the 
square  root  of  the  material  density.  Thus,  the  curve  of  Figure  3-23  can  be  used  to  describe  the 
“fallout”  of  “spherical”  particles  other  than  water  by  multiplying  the  terminal  velocity  by  the  square 
root  of  the  density.  For  example,  dust  particles  have  densities  between  1.5  and  2.6  g/m^  Such 
particles  should  fall  between  1.2X  and  1.6X  faster  than  raindrops  of  the  same  size. 

The  generic  extinction  behavior  of  the  large  water  droplets  and  small  water  droplets  is  shown 
in  Figure  3-24.  Both  exhibit  behavior  characteristic  of  Mie  scattering.  The  small  particles  have 
maximum  extinction  in  the  visible  and  infrared.  The  extinction  then  decreases  rapidly  becoming 
negligible  in  the  microwave  region  (below  10  GHz).  Just  like  the  aerosol  extinction  described  in 
the  preceding  section,  the  small  particle  extinction  will  scale  inversely  with  the  visibility.  The  large 
particles  have  more  or  less  constant  extinction  from  the  visible  to  the  millimeter- wave  region.  The 
extinction  then  falls  off  rapidly,  becoming  negligible  below  in  the  UHF  and  VHF  regions.  The  total 
large  particle  extinction  scales  with  the  total  large  particle  concentration  (essentially  with 
precipitation  rate). 

We  address  the  small  suspended  aerosols  in  the  current  section.  Large  hydrometeors  or 
precipitation  are  covered  in  the  following  section.  In  the  visible  and  infrared  regions,  hydrometeor 
aerosols  are  treated  like  any  other  aerosol.  Figures  3-25  and  3-26  show  the  relative  extinction  and 
relative  absorption  of  two  different  fog  aerosols.  Radiation  fogs  are  formed  when  the  emission  of 


Figure  3-24.  Summary  of  hydrometeor  effects  on  electromagnetic  radiation.  [14] 
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Figure  3-25.  Relative  extinction  coefficient  of  two  different  generic  fog  aerosols.  [1 1] 
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Figure  3-26.  Relative  absorption  of  two  generic  fog  aerosols.[l  1] 
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thermal  radiation  from  an  airmass  causes  it  to  cool  below  its  dewpoint.  When  the  air  has  cooled 
sufficiently,  some  trigger  (the  presence  of  nucleation  particles,  turbulence,  cosmic  rays  - this  is  what 
causes  tracks  to  form  in  a Wilson  cloud  chamber,  etc.)  causes  the  water  in  the  supersaturated  air  to 
condense  into  thousands  of  tiny  droplets  (fog).  Radiation  fogs  evolve  with  time.  The  initial 
concentration  of  water  is  fixed  by  the  amount  that  was  present  in  the  supersaturated  air.  There  is 
little  or  no  horizontal  transport  when  conditions  are  optimal  for  radiation  fog  formation.  As  time 
goes  by,  collision  between  the  tiny  droplets  cause  them  to  coalesce  into  larger  and  larger  droplets, 
until  they  grow  large  enough  to  settle  out  of  the  air.  Because  there  is  no  replacement  of  the  small 
droplets,  the  average  particle  size  in  an  evolving  radiation  fog  grows  with  time  while  the  average 
density  of  particles  decreases  with  time.  The  “Tule  fogs”  that  regularly  menace  traffic  in  the  Central 
Valley  of  California  are  classic  examples  of  radiation  fogs. 

Advection  fogs  form  when  warmer,  moister  air  flows  over  a colder  surface.  Advection  fogs 
tend  to  evolve  spatially  rather  than  over  time.  Small  particles  are  continually  being  formed  as  new 
masses  of  warm,  moist  air  contact  the  cold  surface.  As  the  fog  drifts,  the  particles  coalesce  as  in  a 
radiation  fog.  Greater  degrees  of  evolution  correspond  to  greater  distances  from  the  “generation” 
zone.  The  sea  fogs  that  regularly  form  off  the  California  coast  and  frequently  roll  in  over  the  hills 
around  San  Francisco  Bay  are  classic  examples  of  advection  fogs. 

Other  types  of  fog  include  steam  fogs,  warm-front  fogs,  and  up-slope  fogs.  Steam  fogs 
result  when  cold  air  moves  over  warm  water.  The  high  evaporation  rate  from  the  warm  water 
supersaturates  the  cold  air  and  wisps  of  fog  will  form.  If  the  body  of  water  is  large  enough,  the 
wisps  may  coalesce  into  a dense  fog.  Sea  smoke  is  a form  of  steam  fog.  Warm-front  fog  occurs 
at  such  fronts  when  warm  rain  falls  through  colder  air  beneath  it.  Under  certain  conditions,  the 
evaporation  of  this  rain  can  produce  a supersaturated  mass  of  air  can  be  formed  at  ground  level. 
Such  fogs  can  be  extremely  dense.  Up-slope  fogs  occur  when  warm,  moist  air  is  forced  by  wind 
up  the  slopes  of  mountains.  As  the  air  adiabatically  cools,  it  can  become  supersaturated  and  a fog 
will  form.  Such  fogs  can  be  extremely  dense. 

Both  radiation  fogs  and  advection  fogs  tend  to  be  relative  thin  in  terms  of  height.  Some  fogs 
may  be  only  a few  meters  thick.  It  is  not  unknown  for  “steam  fogs”  on  lakes  to  be  so  dense  that  a 
person  in  a rowboat  can  barely  see  the  front  of  his  boat,  but  so  thin  that  if  that  person  stands  up,  his 
head  is  above  the  fog  layer.  Steam  fogs  seldom  exceed  10  m thickness. [25]  Valley  fogs  (a  form  of 
radiation  fog)  are  often  thin  enough  that  only  the  valley  bottoms  are  in  the  fog;  hills  fifty  meters 
higher  in  altitude  may  be  above  the  fog.  Fifteen  to  one  hundred  meters  is  a reasonable  estimate  for 
the  thickness  (height)  of  any  fog  layer,  although  thicknesses  as  high  as  200-300  meters  are  not 
unknown. 

The  World  Meteorological  Organization  has  recognized  ten  basic  cloud  types  or  genera.  The 
descriptions  of  these  ten  genera  are  drawn  from  their  International  Cloud  Atlas  [26],  [25].  The  cloud 
types  and  their  shorthand  notations  (two  letters  in  parentheses)  are: 

(Ci)  Cirrus  - Detached  clouds  in  the  form  of  white,  delicate  filaments  or  white  or  mostly  white 

patches  or  narrow  bands.  These  clouds  have  a fibrous  (hair-like)  appearance,  or  a silky 

sheen,  or  both. 

(Cc)  Cirrocumulus  - Thin,  white  patch,  sheet,  or  layer  of  cloud  without  shading,  composed  of 
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very  small  elements  in  the  form  of  grains,  ripples,  etc.,  merged  or  separate,  and  more  or  less 
regularly  arranged;  most  of  the  elements  have  an  apparent  width  of  less  than  one  degree. 
(Cs)  Cirrostratus  - Transparent,  whitish  cloud  veil  of  fibrous  (hair-like)  or  smooth  appearance, 
totally  or  partly  covering  the  sky,  and  generally  producing  halo  phenomena. 

(Ac)  Altocumulus  - White  or  grey,  or  both  white  and  grey,  patch,  sheet  or  layer  of  cloud, 
generally  with  shading,  composed  of  laminae,  rounded  masses,  rolls,  etc.,  which  are 
sometimes  partly  fibrous  or  diffuse,  and  which  may  or  may  not  be  merged;  most  of  the 
regularly  arranged  small  elements  usually  have  an  apparent  width  of  between  one  and  five 
degrees. 

(As)  Altostratus  - Greyish  or  bluish  cloud  sheet  or  layer  of  striated,  fibrous  or  uniform 
appearance,  totally  or  partly  covering  the  sky,  and  having  parts  thin  enough  to  reveal  the  sun 
at  least  vaguely,  as  through  ground  glass.  Altostratus  does  not  show  halo  phenomena. 
(Ns)  Nimbostratus  - Grey  cloud  layer,  often  dark,  the  appearance  of  which  is  rendered  diffuse 
by  more  or  less  continually  falling  rain  or  snow  which  in  most  cases  reaches  the  ground.  It 
is  thick  enough  throughout  to  blot  out  the  sun.  Low,  ragged  clouds  frequently  occur  below 
the  layer  with  which  they  may  or  may  not  merge. 

(Sc)  Stratocumulus  - Grey  or  whitish,  or  both  grey  and  whitish,  patch,  sheet  or  layer  of  cloud 
which  almost  always  has  dark  parts,  composed  of  tessellations,  rounded  masses,  rolls,  etc., 
which  are  non-fibrous  (except  for  virga  - precipitation  trails)  and  may  or  may  not  be  merged; 
most  of  the  regularly  arranged  small  elements  have  an  apparent  width  of  more  than  five 
degrees. 

(St)  Stratus  - Generally  grey  cloud  layer  with  a fairly  uniform  base,  which  may  give  drizzle,  ice 

prisms  or  snow  grains.  When  the  sun  is  visible  through  the  cloud  its  outline  is  clearly 
discernible.  Stratus  does  not  produce  halo  phenomena  (except  possibly  at  very  low 
temperatures).  Sometimes  stratus  appears  in  the  form  of  ragged  patches. 

(Cu)  Cumulus  - Detached  clouds,  generally  dense  and  with  sharp  outlines,  developing  vertically 
in  the  form  of  rising  mounds,  domes,  or  towers,  of  which  the  bulging  upper  part  often 
resembles  a cauliflower.  The  sunlit  parts  of  these  clouds  are  mostly  brilliant  white;  their 
bases  are  relatively  dark  and  nearly  horizontal.  Sometimes  cumulus  is  ragged. 

(Cb)  Cumulonimbus  - Heavy  and  dense  cloud,  with  a considerable  vertical  extent,  in  the  form 
of  a mountain  or  huge  towers.  At  least  part  of  its  upper  portion  is  usually  smooth,  or  fibrous, 
or  striated,  and  nearly  always  flattened;  this  part  often  spreads  out  in  the  shape  of  an  anvil 
or  vast  plume.  Under  the  base  of  this  cloud,  which  is  often  very  dark,  there  are  frequently 
low  ragged  clouds  either  merged  with  it  or  not,  and  precipitation,  sometimes  in  the  form  of 
virga. 

Figure  3-27  provides  a rough  illustration  of  each  of  these  cloud  types  along  with  the  altitudes  at 
which  they  are  commonly  encountered. 

The  characteristics  of  the  ten  cloud  types  are  given  in  Table  3-5.  Summarized  here  are 
typical  base  heights  (the  height  of  the  base  of  the  cloud  above  the  ground),  cloud  thicknesses, 
temperatures  at  the  base  of  the  cloud,  constituents  (the  physical  form  of  water  comprising  the  cloud), 
and  associated  precipitation  (if  any).  The  ten  cloud  types  are  commonly  aggregated  into  three  or 
four  classes  based  on  their  location  in  the  atmosphere.  The  three  basic  classes  are  low  (cloud  base 
below  2 km  altitude),  medium  (cloud  base  between  2 and  7 km),  and  high  (cloud  base  between  5 and 
1 3 km).  Low  clouds  are  sometimes  further  distinguished  between  those  with  and  without  extensive 
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vertical  development  (vertical  structure  and  extended  thickness).  A quick  examination  of  the  list 
also  reveals  that  clouds  can  also  be  classified  by  general  shape:  stratiform  (layered),  cumuliform 
(heaped),  or  cirriform  (fibrous). 

To  permit  prediction  of  attenuation  in  the  visible  and  the  infrared,  several  cloud  models  have 
been  developed  for  use  with  LOWTRAN  and  its  related  codes.  These  include  cumulus,  altostratus, 
stratus,  stratocumulus,  and  nimbostratus  cloud  types.  Models  have  been  developed  for  two  kinds 
of  cirrus  (small  ice  crystals  averaging  4 pm  in  length  and  large  ice  crystals  averaging  64  pm  in 
length)  as  well.  For  the  five  low  to  medium  cloud  types,  the  liquid  water  content  (the  weight  of 
water  in  liquid  form  per  unit  volume  of  cloud)  is  shown  graphically  in  Figure  3-28  as  a function  of 
altitude  and  in  tabular  form  in  Table  3-6.  Table  3-7  summarizes  the  characteristics  of  the  particles 
that  make  up  all  of  the  LOWTRAN  hydrometeor  models.  Each  “cloud”  is  assumed  to  have  a 
number  density  of  particles  versus  particle  size  that  behaves  as 
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with  the  total  number  density  of  particles  being  obtained  by  integration  of  the  expression  above,  and 


Figure  3-27.  The  ten  basic  cloud  types. 
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Table  3-5.  Characteristics  of  the  ten  cloud  genera. 


CLOUD 

GENUS 

(Type) 

CLOUD 

CONSTITUENTS 

BASE 

HEIGHT 

(kml 

CLOUD 

THICKNESS 

(kml 

BASE 

TEMP. 

m 

TYPE  OF 
PRECIPITATION 

Ci 

(High) 

Ice  Crystals 

7 - 10 

0.1 -0.2  thin 
to  1 thick 

-20  to  -60 

None 

Cc 

(High) 

Ice  Crystals 

6 - 8 

0.2  - 0.4 

-20  to  -60 

None 

Cs 

(High) 

Ice  Crystals 

6 - 8 

0.1  to 

-20  to  -60 

None 

Several 

Ac 

(Middle) 

Water  Droplets, 
Ice  Crystals,  or 
Snowflakes 

2 - 6 

0.1  - 1.0 

-l- 1 0 to  -30 

Rare,  Light 
Snow  or  Rain 

As 

(Middle) 

Ice  Crystals 
(thin  cloud) 
Water  Droplets 
(thick  cloud) 

3 - 5 

0.1  thin, 
6-8  thick 

-MO  to  -30 

Light  Rain, 
Light  to 
Medium  Snow 

Sc 

(Low) 

Water  Droplets 
(7-8  Microns) 

0.5  - 1.5 

0.2  - 0.8 

-1- 1 5 to  -5 

Usually  none; 
Occasional  Lt. 
Rain  or  Snow 

St 

(Low) 

Water  Droplets 
(4  Microns) 

0.1  - 0.7 

0.2  - 0.8 

-1-20  to  -5 

Drizzle,  Light 
Rain  or  Snow 

Ns 

(Vert. 

Devel.) 

Ice  Crystals, 
Water  Droplets 
(1 0 Microns) 

0.1  - 1.0 

1 to 
Several 

-1-10  to  -5 

Continuous 
Rain  or  Snow, 
Light  to  Heavy 

Cu 

(Vert. 

Devel.) 

Water  Droplets 
(6  Microns) 

0.8  - 1.5 

0.15  - 1.5 

-Hi 5 to  -5 

Occasional 
Light  Rain  or 
Snow 

Cb 

(Vert. 

Devel.) 

Water  Drops,  Ice, 
Ice  Crystals,  Hail 

0.4  - 1.0 

3 - 15 

-Hi 5 to  -5 

Moderate  to 
Heavy  Rain, 
Snow,  or  Hail 
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Figure  3-28.  Liquid  water  content  vs.  altitude  of  the  LOWTRAN  cloud  models.[l  1] 
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Table  3-6.  Characteristics  of  the  LOWTRAN  cloud  and  rain  models.[l  1] 


LAYER 

HEIGHT 

(km) 

CLOUD  MODELS 

CLOUD  AND  RAIN  MODELS 

LIQUID  WATER  CONTENT  (g/m3) 

RAIN  RATE  (mm/hr) 

MODEL  1 

MODEL  2 

MODEL  3 

MODEL  4 

MODEL  5 

MODEL  6 

MODEL  7 

MODEL  9 

MODEL  10 

CUMULUS 

ALTO- 

STRATUS 

STRATUS 

ST  RATO - 
CUMULUS 

NIMBO- 

STRATUS 

DRIZZLE 

FROM 

STRATUS 

LIQHT 

RAIN 

FROM 

NIMBO- 

STRATU8 

MODERATE 

RAIN 

FROM 

NIMBO- 

8TRATUS 

HEWY 

RAIN 

FROM 

CUMULUS 

EXTREME 

RAIN 

FROM 

CUMULUS 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

2.00 

12.5 

25.0 

0.16 

0.00 

0.00 

0.00 

0.00 

0.30 

1.78 

10.5 

21.5 

70.0 

0.33 

0.00 

0.00 

0.15 

0.00 

0.65 

1.43 

8.0 

17.5 

65.0 

0.66 

0.20 

0.00 

0.30 

0.10 

0.40 

1.22 

6.0 

12.0 

60.0 

1.00 

0.35 

0.00 

0.15 

0.15 

0.00 

0.86 

0.8 

2.5 

7.5 

4 5.0 

1.50 

1.00 

0.00 

0.00 

0.15 

0.00 

0.22 

0.2 

0.8 

4.2 

20.0 

2.00 

1.00 

0.00 

0.00 

0.10 

0.00 

0.00 

0.0 

0.2 

2.5 

12.5 

2.40 

1.00 

0.30 

0.00 

0.00 

0.00 

0.00 

0.0 

0.0 

1.0 

7.0 

2.70 

0.30 

0.40 

0.00 

0.00 

0.00 

0.00 

0.0 

0.0 

3.5 

3.00 

0.15 

0.30 

0.00 

0.00 

0.00 

0.00 

0.0 

1.0 

3.50 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.0 

0.2 

4.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.00 

0.0 

0.0 

0.0 
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Table  3-7.  Characteristics  of  the  LOWTRAN  hydrometeor  particle  distributions.  [1 1], [27] 


HYDRO-  SIZE  DISTRIBUTION  PARAMETERS 


METEOR 

TYPE 

(y  = 

a 

1 for  all  distributions) 
a b 

HEAVY  FOG, 
ADVECTION 

3 

0.27 

0.3 

MED.  FOG, 
RADIATION 

6 

607.5 

3.0 

CUMULUS 

3 

2.604 

0.5 

STRATUS 

2 

27.0 

0.6 

STRATO- 

CUMULUS 

2 

52.734 

0.75 

ALTO- 

CUMULUS 

5 

6.268 

1.111 

NIMBO- 

STRATUS 

2 

7.678 

0.425 

CIRRUS 

6 

2.21x10'^ 

0.09375 

THIN 

CIRRUS 

6 

0.011865 

1.5 

NUMBER  LIQUID  HjO  MODAL  RADII  0.55|am 
DENSITY  CONTENT  (No.)  (Mass)  EXTINCT. 
N W Rn  Rm  k 


(cm'^) 

(q-m-^) 

(urn) 

(urn) 

(kmh 

20 

0.37 

10.0 

20.0 

28.74 

20 

0.02 

2.0 

3.0 

8.672 

250 

1.00 

6.0 

12.0 

130.8 

250 

0.29 

3.33 

8.33 

55.18 

250 

0.15 

2.67 

6.67 

35.65 

400 

0.41 

4.5 

7.2 

91.04 

200 

0.65 

4.7 

11.76 

87.08 

0.025 

0.06405 

64.0 

96.0 

1.011 

0.5 

3.128x10'^ 

4.0 

6.0 

0.0831 

is  therefore, 

N = 

In  the  visible  and  infrared  regions  of  the  spectrum,  the  extinction  of  clouds  is  handled  very  similar 
to  that  of  fogs.  Figures  3-28  and  3-29  show  the  relative  extinction  coefficient  and  relative 
absorption  of  the  five  low  to  medium  cloud  types.  The  extinction  coefficient  at  0.55  pm  of  the  cloud 
at  maximum  liquid  water  content  (LWC)  is  given  in  the  last  column  of  Figure  3-7.  The  0.55  pm 
extinction  coefficient  is  scaled  by  the  LWC  as  a function  of  position  along  the  propagation  path 
(Figure  3-27).  It  is  further  scaled  to  accommodate  other  wavelengths  using  Figure  3-28.  Figures 
3-30  and  3-31  show  the  relative  extinction  coefficient  and  relative  absorption  of  the  two  cirrus 
models.  Attenuation  due  to  cirrus  uses  the  0.55  pm  extinction  coefficient  multiplied  by  the  cirrus 
thickness  (user  selectable)  and  further  modified  by  the  relative  extinction  data  of  Figure  3-30  to 
accommodate  other  wavelengths. 


(Y  -f  1 
I r ) 


(3.21) 
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Figure  3-29.  Relative  extinetion  eoeffieient  for  five  eloud  aerosols. [1 1] 
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Figure  3-30.  Relative  absorption  for  five  eloud  aerosols.  [1 1] 
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Figure  3-31.  Relative  extinetion  eoeffieient  for  two  eirms  aerosols. [1 1] 
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Figure  3-32.  Relative  absorption  for  two  eirms  aerosols.[l  1] 
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In  the  microwave  region,  the  attenuation  from  the  small  hydrometeors  can  be  estimated  from 
the  complex  refractive  index.  The  refractivity  N of  air  is  related  to  the  refractive  index  n by 

A^  = («-l)xlO^  ppm  (3.22) 

The  refractive  index  of  air  is  so  close  to  one  that  large  relative  changes  do  not  show  up  explicitly. 
The  refractivity  is  a large  number  in  which  even  small  changes  in  refractive  index  can  be  observed 
directly.  Many  atmospheric  constituents  contribute  directly  to  the  refractivity.  The  contribution 
from  liquid  water,  when  the  water  droplets  are  small  enough  to  be  treated  in  the  Rayleigh 
approximation,  is  given  by  [28] 


a 


w 


0.182  V 


= 4.50  w 


[s' + 2)  +6"'^ 


ppm 


(3.23) 


where  vis  the  microwave  frequency,  or^is  the  absorption  coefficient  (in  dB/km),  s’  and  s”  are  the 
real  and  imaginary  components  of  the  dielectric  constant.  Using  Debye  theory,  the  dielectric 
constant  component  of  water  were  determined  to  be  [29] 


and 


e'(v) 

s"[v) 


(72.12  + 103.3(^-1))^  (1.97)  ^ ^ 

(\^(v!vpf)  (i  + (k/Kj)’) 

(72.12  + 103.3(0-  l))(r/  v,)  (\.91)(v I v,) 

[\  + (v!Vpf)  (l  + (r/rs)') 


where  the  primary  and  secondary  relaxation  frequencies  are 

Vp  =20.09 -142(6>-l)  + 294(6'-l)"  GHz 

and 

v^  =590-1 500(6»-l)  GHz 

and  6>is  a reduced  temperature  given  by 


^=300/r 


(3.24) 


(3.25) 


(3.26) 

(3.27) 


(3.28) 


The  variants  of  these  equations  used  in  [28]  have  been  used  to  determine  the  attenuation  coefficient 
of  hydrosols  (fog  or  cloud)  for  a liquid  water  content  (LWC)  w=  \ g/m^  These  are  shown  in  Table 
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Table  3-8.  Hydrosol  (haze,  fog,  cloud)  attenuation  for  a mass  concentration  of  w = 1 g/m^ 

ATTENUATION  COEFFICIENT  fdB/km) 
TEMPERATURE  FREQUENCY  (GHz) 


T 

0 

1 

10 

30 

100 

200 

300 

0 

1.0983 

0.001 

0.097 

0.82 

5.4 

9.3 

10.7 

25 

1 .0062 

0.001 

0.051 

0.45 

4.2 

10.8 

15.3 

3-8.  The  attenuation  at  other  water  contents  scales  linearly.  If  ^ is  a coefficient  from  the  table,  then 
the  LWC-dependent  coefficient  is 

A(w)=A-w.  (3.29) 

In  Table  3-9,  the  LWC  data  from  Table  3-7  has  been  combined  with  the  data  in  Table  3-8, 
to  estimate  the  maximum  attenuation  coefficient  provided  by  each  of  the  LO  WTRAN  fog  and  cloud 
models.  From  the  data  it  appears  that  fog  and  clouds  are  negligible  contributors  to  attenuation  at  1 
GHz,  marginal  (<  1 dB  maximum  path  attenuation  for  reasonable  paths)  contributors  at  1 0 GHz,  but 
serious  attenuation  sources  at  30  GHz  and  above. 


Table  3-9.  Maximum  attenuation  coefficients  for  the  LOWTRAN  hydrometeor  models. 


HYDRO-  TEMP.  MAX.  MAX.  ATTENUATION  COEFFICIENT  (dB/km) 

METEOR  T LWC  FREQUENCY  (GHz) 


TYPE 

(°C) 

(g-m®) 

1 

10 

30 

100 

200 

300 

HEAVY  FOG, 

0 

0.37 

0.00037 

0.0359 

0.303 

2.00 

3.44 

3.96 

ADVECTION 

25 

0.00037 

0.0189 

0.167 

1.55 

4.00 

5.66 

MED.  FOG, 

0 

0.02 

2x1 0-® 

0.00194 

0.0162 

0.108 

0.186 

0.214 

RADIATION 

25 

2x10'® 

0.00102 

0.009 

0.084 

0.216 

0.326 

CUMULUS 

0 

1.00 

0.001 

0.097 

0.82 

5.4 

9.3 

10.7 

25 

0.001 

0.051 

0.45 

4.2 

10.8 

15.3 

STRATUS 

0 

0.29 

0.00029 

0.0281 

0.238 

1.57 

2.70 

3.10 

25 

0.00029 

0.0148 

0.131 

1.22 

3.13 

4.44 

STRATO- 

0 

0.15 

0.00015 

0.0146 

0.123 

0.810 

1.40 

1.61 

CUMULUS 

25 

0.00015 

0.0076 

0.068 

0.630 

1.62 

2.30 

ALTO- 

0 

0.41 

0.00041 

0.0398 

0.336 

2.21 

3.81 

4.39 

CUMULUS 

25 

0.00041 

0.0209 

0.185 

1.72 

4.43 

6.27 

NIMBO- 

0 

0.65 

0.00065 

0.0631 

0.533 

3.51 

4.65 

6.96 

STRATUS 

25 

0.00065 

0.0332 

0.293 

2.73 

7.02 

9.95 
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Hydrometeors  - Precipitation 


Rain  attenuation  can  be  determined  from  Mie  scattering  theory  if  the  raindrop  size 
distribution  is  known.  Most  measurements  of  the  size  distribution  have  attempted  to  fit  the  results 
to  a curve  of  the  form 

dN  -K(R]D 

^niD)  = N.e  ^ > (3.30) 

dD  ^ ^ 

where  is  the  number  density  scale  factor,  A(i?)  is  an  exponent  dependent  on  the  rain  rate  R (in 
mm/hr),  and  D is  the  drop  diameter  (in  mm).  From  Mie  theory  the  extinction  coefficient  a can  be 
written  as  [9] 


a = J” dD  Qext[^D  / ;i,  m(X)]  ^ (3.31) 

where  is  the  Mie  Extinction  Efficiency  and  m{X)  is  the  complex  refractive  index  of  water.  For 
raindrops  in  the  visible  and  infrared  spectral  regions,  D » A.  Therefore,  Q^xt  ~ 2,  independent  of 
wavelength  or  particle  diameter.  Thus,  we  have 

OjuM  = f e-^  = ;rV„A-^  (3.32) 

LOWTRAN  uses  the  Marshall-Palmer  raindrop  distribution 

= (3.33) 

which  leads  to  the  following  expression  for  the  extinction  due  to  rain 

^RAm  - km'^  (3.34) 

or 

dB  / km . (3.35) 

In  the  microwave  spectral  region,  this  simplified  analysis  does  not  suffice.  However,  more 
sophisticated  calculations  have  yielded  tabular  coefficients  (Table  3-10)  for  calculating  the 
attenuation  of  rain  in  the  microwave  as  a function  of  frequency  vand  polarization  (subscript  /?  = v 
for  vertical,  h for  horizontal,  and  c for  circular)  using  an  expression  of  the  form 

= dB/km.  (3.36) 
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Table  3-10.  Coefficients  for  rain  attenuation  calculations  for 
horizontal,  vertical,  and  circular-polarization. [30], [31] 


V (GHz) 

Op 

a„ 

a. 

_bh_ 

_bv_ 

b. 

1 

0.000039 

0.000035 

0.000037 

0.912 

0.880 

0.896758 

2 

0.000154 

0.000138 

0.000146 

0.963 

0.923 

0.944096 

4 

0.00065 

0.000591 

0.000621 

1.121 

1.075 

1 .099093 

6 

0.00175 

0.00155 

0.00165 

1.308 

1.265 

1.287803 

7 

0.00301 

0.00265 

0.00283 

1.332 

1.312 

1.322636 

8 

0.00454 

0.00395 

0.004245 

1.327 

1.310 

1.319091 

10 

0.0101 

0.00887 

0.009485 

1.276 

1.264 

1.270389 

12 

0.0188 

0.0168 

0.0178 

1.217 

1.200 

1.208978 

15 

0.0367 

0.0335 

0.0351 

1.154 

1.128 

1.141593 

20 

0.0751 

0.0691 

0.0721 

1.099 

1.065 

1.082707 

25 

0.124 

0.113 

0.1185 

1.061 

1.030 

1.046219 

30 

0.187 

0.167 

0.177 

1.021 

1.000 

1.011093 

35 

0.263 

0.233 

0.248 

0.979 

0.963 

0.971484 

40 

0.350 

0.310 

0.330 

0.939 

0.929 

0.934303 

45 

0.442 

0.393 

0.4175 

0.903 

0.897 

0.900176 

50 

0.536 

0.479 

0.5075 

0.873 

0.868 

0.870640 

60 

0.707 

0.642 

0.6745 

0.826 

0.824 

0.825048 

70 

0.851 

0.784 

0.8175 

0.793 

0.793 

0.793000 

80 

0.975 

0.906 

0.9405 

0.769 

0.769 

0.769000 

90 

1.060 

0.999 

1.0295 

0.753 

0.754 

0.753485 

100 

1.120 

1.060 

1.090 

0.743 

0.744 

0.743486 

120 

1.180 

1.130 

1.155 

0.731 

0.732 

0.731489 

150 

1.310 

1.270 

1.290 

0.710 

0.711 

0.710492 

200 

1.450 

1.420 

1.435 

0.689 

0.690 

0.689495 

300 

1.360 

1.350 

1.355 

0.688 

0.689 

0.688498 

400 

1.320 

1.310 

1.315 

0.683 

0.684 

0.683498 

The  preceding  expressions  are  applicable  to  determining  the  attenuation  due  to  rain.  They 
are  not  applicable  to  snow.  There  are  many  fewer  measurements  of  snow  attenuation,  and  due  to 
the  non-spherical  character  of  the  particles,  still  fewer  theoretical  analyses.  Let  us  consider  some 
of  the  available  results.  If  the  visibility  V through  the  snow  is  known,  then  the  extinction  (scattering 
- the  extinction  of  snow  will  be  mostly  scattering)  coefficient  can  be  determined  from  the  relation 
[32] 


« = 3.912 /F  km  ^ . (3.37) 

This  relation  can  be  used  with  some  reservations  in  the  visible  and  infrared  spectral  regions  (because 
the  Mie  scattering  efficiency  should  be  in  the  constant  “optical”  region  and  should  be  independent 
of  wavelength).  This  relation  can  also  be  used  for  rain  attenuation. 

If  the  visibility  is  not  known,  we  must  rely  on  a few  measurements.  Figure  3-33  shows  the 
data  obtained  at  10.6  pm  during  one  snowfall  in  the  Soviet  Union.[33] 
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Figure  3-33.  Attenuation  at  10.6  )o,m  wavelength  as  a function  of  snowfall  intensity. [3 3] 


The  Soviet  authors  also  measured  attenuation  at  0.63  |am.  They  obtained  the  following  fits  to  their 


data 

A 0.6 

dB/km 

(3.38) 

and 

^0.63 

dB/km 

(3.39) 

Despite  the  statements  made  in  the  preceding  paragraph  there  appears  to  be  some  wavelength 
dependence  to  the  attenuation.  Given  the  complexity  of  the  shapes  of  snowflakes,  there  is  no  reason 
that  the  attenuation  in  the  visible  and  infrared  must  be  independent  of  wavelength.  Nevertheless, 
the  differences  between  attenuations  at  different  wavelengths  are  small  compared  to  the  attenuation 
differences  between  snow  and  rain.  The  author  fit  an  equation  of  the  Marshall-Palmer  i?®  form  to 
the  10.6  pm  Soviet  data  and  obtained 

f^SNOw  = km-'  (3.40) 

and 

'^swoif  = 16*"“  dB/km.  (3.41) 

The  author’s  fit  gives  a result  for  snow  that  is  almost  exactly  a factor  of  ten  higher  extinction 
coefficient  than  that  for  rain. 
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A Defense  Department  handbook  [32]  on  atmospheric  attenuation  give  several  data  points 
for  snow  attenuation  in  the  millimeter-wave  region  of  the  spectrum.  Table  3-11  presents  coefficients 
of  the  equation 


^ SNOW  ~ (3-42) 

for  three  kinds  of  snow:  dry  snow  (powder  snow),  moist  snow  (small  flakes  that  form  firm  snowballs 
when  compacted),  and  wet  snow  (large  “gloppy”  particles  that  readily  stick  together). 


Table  3-11.  Millimeter-wave  snow  extinction  parameters. [32] 


SNOW 

TYPE 

35  GHz 
c d 

94  GHz 
c 

d 

DRY 

0.0125 

1.60 

0.08 

1.26 

MOIST 

0.160 

0.95 

0.31 

0.75 

WET 

0.235 

0.95 

0.61 

0.75 

This  limited  data  gives  snowfall  rate  dependences  which  are  considerably  different  than  those  in  the 
visible  and  infrared,  and  although  not  considerably  different  from  rain.  Additional  snow  data  can 
be  found  in  Reference  [34].  They  do  little  to  clarify  the  overall  picture.  About  the  only  things  that 
can  be  said  about  attenuation  due  to  snow,  is  (1)  it  will  be  significant  from  the  millimeter- wave 
region  to  the  visible  region  of  the  spectrum,  and  (2)  snow  attenuation  will  be  higher  than  rain 
attenuation  for  the  same  precipitated  equivalent  liquid  water  content. 
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Statistics  of  Weather  Effects 


Weather  is  a random  variable.  We  are  unable  to  accurately  predict  the  weather  in  most 
locations  more  than  a few  days  into  the  future.  However,  using  statistics  of  past  weather  conditions, 
we  can  estimate  with  reasonable  accuracy  the  fraction  of  the  time  that  given  conditions  will  occur. 
This  statistical  data  is  of  even  more  importance  to  system  designers  than  point  data.  For  example, 
if  we  know  that  rain  greater  than  5 mm/hr  will  degrade  the  performance  of  a sensor  design,  it  is 
useful  to  know  what  fraction  of  the  time  in  the  locales  of  interest  that  the  rain  rate  will  exceed  5 
mm/hr.  If  the  number  is  very  large  we  will  have  a problem.  If  it  is  very  small,  the  degradation  may 
be  deemed  insignificant. 

A number  of  groups  at  various  times  have  for  their  own  reasons  collected  and  analyzed 
weather  statistics.  Occasionally,  this  data  is  made  available  to  the  community  as  a whole.  One  such 
set  of  statistical  data  was  published  by  Dynetics,  Inc.  for  the  Army  Materiel  Command,  Smart 
Weapons  Management  Office,  in  the  form  of  cumulative  probability  curves  on  a wall  chart. [3 5] 
These  curves  are  reproduced  here  as  Figures  (3-34)  through  (3-37).  Cumulative  probability  curves 
for  visibility,  humidity,  rain  rate,  and  cloud  base  height  are  presented  for  three  nominal  geographic 
environments:  European  Lowlands,  Mideast  Deserts,  and  Central  America.  A cumulative 

probability  curve  is  a plot  of  the  ^"Probability  that  Variable  X has  a value  less  than  or  equal  to  x” 
as  a function  of  “x”.  When  X is  at  its  minimum  value,  P(X^in)  = 0.  When  X is  at  its  maximum 
value,  P(X^j,,;)  = 1.  When  X is  at  its  median  value,  P(X^^^ij,„)  = 0.5. 

Cumulative  probability  curves  are  often  used  in  the  following  manner.  A probability  value 
that  is  acceptable  from  a system  perspective  is  selected.  For  example,  it  may  be  acceptable  for  a 
system  to  suffer  degradation  from  excessive  humidity  a maximum  of  10%  of  the  time  in  Central 
America.  That  is,  the  humidity  must  be  below  some  as  yet  unknown  maximum  acceptable  value  for 
90%  of  the  time.  Picking  an  0.9  cumulative  probability  and  finding  the  curve  for  humidity  in 
Central  America  (Figure  3-35,  we  find  a humidity  value  of  slightly  less  than  21  g/m^.  This  value 
now  become  a derived  system  specification.  That  is,  the  system  must  work  without  degradation  at 
a value  of  21  g/m^  If  it  can  do  that  then  it  will  work  at  least  90%  of  the  time  in  Central  America 
(barring  other  variables  that  also  limit  performance). 

In  another  example,  it  was  desired  that  a sensor  operate  under  at  least  95%  of  all  visibility 
conditions.  This  requires  that  the  cumulative  probability  of  having  the  an  unacceptable  visibility 
can  only  be  5%.  Looking  at  Figure  3-34  we  find  that  a 7 km  visibility  is  acceptable  in  Central 
America  and  the  Mideast,  but  in  the  European  Lowlands,  the  5%  cumulative  probability  value 
corresponds  to  only  1.5  km.  The  systems  engineer  is  left  with  a decision  as  to  whether  a rather 
benign  7 km  visibility  requirement  (i.e.,  the  sensor  must  perform  adequately  when  the  visibility  is 
as  low  as  7 km)  or  a rather  severe  1 .5  km  visibility  requirement  should  be  imposed.  The  resolution 
of  this  problem  would  depend  on  the  exact  nature  of  the  mission  need. 

The  four  variables  in  the  Dynetics  data  set  are  generally  very  useful,  but  they  are  by  no 
means  complete.  Data  is  desirable  for  other  geographic  locations  (such  as  Southeast  Asia,  the 
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Figure  3-34.  Cumulative  probability  for  visibility  in  three  geographic  locations. [3 5] 


Figure  3-35.  Cumulative  probability  for  absolute  humidity  in  three  geographic  locations. [35] 
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Korean  Peninsula,  or  the  Antaretie)  and  for  other  meteorological  variables  (including  wind  speed, 
ambient  pressure,  air  temperature,  percent  cloud  cover,  etc). 

A relatively  complete  database  was  developed  by  Dr.  Ken  Hepfer  of  the  Dahlgren  Division 
of  the  Naval  Surface  Warfare  Center  (NSWC)  in  June  1993  for  the  Electro-Optical  Cost  and 
Operational  Effectiveness  Analysis  (COEA).[36]  The  observations  were  “randomly”  selected  from 
thousands  of  weather  ship  measurements  using  a filter  that  guaranteed  that  the  database  included  an 
equal  number  of  observations  (96  each)  from  the  Baltic  Sea,  the  Yellow  Sea,  the  Gulf  of  Oman,  and 
the  Caribbean  Sea  (25%  North  latitudes,  50%  mid-latitudes,  and  25%  tropics)  equally  divided 
between  the  months  of  the  year  (8  observations  per  month  per  location).  Each  observation  contains 
all  of  the  weather  data  collected  by  each  site.  This  includes  air  temperature,  sea  temperature, 
pressure,  relative  humidity,  absolute  humidity,  visibility,  wind  speed,  wind  direction,  rain  rate,  cloud 
cover,  and  turbulence  strength.  This  “R384"  database  has  two  main  uses  in  sensor  system  analysis. 
In  one  application,  a maritime  sensor-plus-atmospheric  model  can  be  exercised  using  each  of  the  3 84 
weather  observations.  The  384  performance  predictions  provide  the  basis  for  a “world- wide” 
operational  utility  analysis  (see  Figure  3-38),  that  is,  an  analysis  which  precisely  defines  what 
fraction  of  the  time  during  a year,  a given  season,  or  given  time  of  day,  a stated  or  desired  level  of 
performance  can  be  obtained.  Such  analyses  can  serve  as  a means  to  fairly  compare  sensors  with 


Figure  3-38.  Elements  of  an  operational  utility  analysis. 
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possibly  significantly  different  design  parameters,  design  philosophies,  or  principle  of  operation. 

In  the  second  application,  the  384  observations  can  be  input  to  a standard  model  such  as 
LOWTRAN  to  determine  values  for  atmospheric  transmission  in  a few  explicit  wavebands.  NSWC 
performed  such  calculations  for  several  infrared  wavebands  and  several  millimeter-wave  wave- 
lengths. The  results  were  then  statistically  analyzed  to  develop  cumulative  probability  of  attenuation 
curves.  For  the  infrared  bands,  the  attenuation  was  modeled  as 

T = (3.43) 

where  T is  the  transmission  over  a range  R (in  km),  a is  an  “attenuation  factor”  and  y^is  a correction 
factor  that  account  for  non-uniform  attenuation  across  a waveband.  For  a very  narrow  waveband, 
would  reduce  to  1,  and  or  would  become  the  extinction  coefficient.  Table  3-12  shows  the  two 
attenuation  coefficients  a and  for  the  3-5  pm  and  8-12  pm  bands  at  a number  of  selected  percentile 
points  of  the  cumulative  probability  distributions.  For  example,  if  one  wanted  to  design  a 3-5pm 
sensor  to  work  in  at  least  80%  of  world-wide  maritime  weather  conditions,  then  from  the  80- 
percentile  numbers,  one  should  design  the  sensor  to  work  with  a = 0.75743  and  p = 0.57004. 
Alternatively,  if  an  8-12  pm  sensor  was  capable  of  operating  in  weather  with  a = 0.72255  and  P= 
0.92409,  then  that  sensor  should  be  capable  of  operating  at  least  95%  of  the  time  anywhere  in  the 
world’s  maritime  environment. 


Table  3-12.  R384  Attenuation  coefficients  for  the  3-5  pm  and  8-12  pm  bands 


at  various  percentiles. [36] 

3-5  pm  8-12  pm 


PERCENTILE 

a 

^ r 

fi 

a 

_ r 

fi 

5 

0.45039 

0.58359 

0.12151 

0.84706 

10 

0.48351 

0.58219 

0.14316 

0.85943 

25 

0.56137 

0.57989 

0.21926 

0.88006 

50 

0.67340 

0.57156 

0.41487 

0.88614 

70 

0.72825 

0.56431 

0.52054 

0.88492 

80 

0.75743 

0.57004 

0.57288 

0.88366 

85 

0.77748 

0.57151 

0.59310 

0.88554 

90 

0.81578 

0.58742 

0.63084 

0.88640 

95 

0.98757 

0.69721 

0.72255 

0.92409 
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In  the  mid-1970's,  Kleiman  and  Modica  [37]  at  M.I.T.  Lincoln  Laboratory  used  data  from 
the  Rand  Weather  Data  Bank  [38]  to  perform  a similar  probabilistic  analysis.  In  1974,  this  database 
had  weather  reports  from  83  locations  around  the  world,  with  an  average  of  12  years  of  continuous 
reporting  from  each  site  (during  the  1950's  and  1960's),  mostly  at  hourly  or  3-hourly  intervals. 
Kleiman  and  Modica  used  all  of  the  data  from  9 of  the  83  weather  stations  in  the  archive  and 
performed  a LOWTRAN3  calculation  for  each  weather  observation.  From  these  calculations  they 
were  able  to  estimate  the  cumulative  extinction  probability  curves  for  several  different  sensor 
systems  at  different  times  of  the  year  and  undertaking  different  assumption.  The  calculated  data  for 
Berlin  during  the  summer  and  winter  seasons  at  10.6  pm  wavelength  are  shown  in  Figure  3-39. 
Please  note  that  LOWTRAN  has  been  modified  many  times  since  1975  to  considerably  improve  its 
accuracy.  The  data  in  Figure  3-39  should  be  assumed  to  display  trends  and  relative  values,  but  are 
not  quantitatively  accurate. 

There  are  several  kinds  of  information  summarized  in  these  curves.  The  solid  curve  is  the 
overall  “all-weather”  cumulative  probability  curve.  For  example,  we  can  use  the  Berlin-Winter 
curve  to  determine  that  80%  of  time  during  the  winter,  the  extinction  was  less  than  3 dB/km.  The 
horizontal  dashed  lines  divide  100%  probability  into  three  weather  conditions:  rain/ snow,  fog/haze. 


Figure  3-39.  Cumulative  atmospheric  extinction  probability  curves  for  Berlin 
during  the  summer  and  the  winter.  [36] 
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and  clear.  The  height  of  each  region  tells  the  percentage  of  the  time  that  each  of  the  three  conditions 
occurred.  The  dashed  curves  are  the  cumulative  probabilities  conditional  on  the  stated  weather 
condition  existing.  For  example,  in  the  summer,  if  it  was  raining,  then  73%  of  the  time  the 
attenuation  was  less  than  3 dB/km  and  it  rained  12%  of  the  time.  If  it  was  foggy  or  hazy  in  the 
summer  (a  1-2%  occurrence),  then  the  attenuation  was  never  less  than  5 dB/km.  Summer  fogs  are 
certified  10  pm  sensor  killers. 

Cumulative  extinction  probability  curves  are  useful  for  determining  when  conditions  of 
“diminishing  returns”  exist.  Consider  the  summertime  curve.  A sensor  capable  of  working  in  3 
dB/km  attenuation  will  work  95%  of  the  time.  In  order  to  boost  the  performance  percentage  to  97%, 
the  sensor  must  be  designed  to  work  in  5 dB/km  attenuation.  If  the  nominal  operating  range  is  10 
km,  then  the  difference  between  3 dB/km  and  5 dB/km  is  20  dB  for  a passive  sensor  and  40  dB  for 
an  active  sensor.  A sensor  designed  to  handle  20-40  dB  higher  attenuation  will  of  necessity  be 
bigger,  consume  more  power,  weigh  more,  and  cost  considerably  more.  Seldom  is  the  cost  involved 
worth  the  benefit  of  only  2%  more  operational  utility.  The  “knee”  of  the  all-weather  attenuation 
curve  occurs  at  3 dB/km.  It  is  common  that  the  knee  of  a cumulative  probability  curve  represents 
a point  of  diminishing  returns.  Sensor  systems  designers  need  to  be  aware  of  this  to  avoid  costly 
overdesign. 

Such  cumulative  probability  analyses  are  so  valuable,  that  it  is  strange  that  they  are  not  done 
more  often.  When  Kleiman  and  Modica  did  their  study  in  1975,  the  calculations  required  hundreds, 
if  not  thousands  of  hours  of  mainframe  time.  Today,  the  same  calculations  could  be  done  in  a few 
hours  on  a laptop  computer.  The  computational  resources  are  now  ubiquitous,  the  codes  are  still 
available,  and  the  weather  data  is  available  from  one  source  or  another.  This  author  recommends 
that  this  type  of  probabilistic  analysis  be  performed  on  every  new  sensor  design. 

There  are  many  sources  of  cumulative  probability  data  that  can  be  used  by  the  sensor 
designer  and  the  systems  engineer  to  evaluate  performance.  Hundreds  of  measurement  programs 
have  been  conducted  by  government  agencies,  universities,  and  private  companies  to  facilitate 
analysis  of  their  own  sensor  developments.  Unfortunately,  it  is  difficult  to  find  all  of  this 
information.  Some  of  it  is  published  as  adjunct  technical  data  supporting  a system,  some  of  it  is 
published  in  formal  technical  reports,  much  of  it  is  published  in  short  course  notes  presented  at 
technical  society  meetings  by  the  experts  that  collected  the  data,  some  is  published  in  short  notes 
to  trade  magazines,  and  some  of  it  is  only  “published”  in  internal  memoranda  or  viewgraph  briefing 
packages  (that  sometimes  find  sizable  external  distribution).  Most  successful  sensor  design 
engineers  and  systems  engineers  actively  seek  out  and  acquire  a substantial  amount  of  this  data 
during  their  careers.  New  entrants  into  these  professions  would  do  well  to  acquire  as  much  of  this 
data  as  is  possible  from  the  older  professionals  they  work  with.  It  is  often  these  compilations  that 
let  the  “expert”  professionals  generate  the  quantity  of  creative  syntheses  and  analyses  that  earned 
them  the  title  of  expert  in  the  first  place.  The  author  cannot  reproduce  every  such  data  set  in  his  own 
collection.  He  has  included  a few  that  seem  most  likely  to  be  of  benefit.  Figure  3-40  shows  the 
cumulative  probability  of  attenuation  in  the  8-12  pm  thermal  imaging  band  obtained  during  the 
winter  months  at  Meppen,  Germany  during  1976-77  by  Project  OPAQUE. [39]  Figure  3-41 
compares  10.6  pm  and  1.06  pm  attenuation  data  collected  by  Plessey  Radar  Ltd.  at  their  Isle  of 
Wight  facility  during  October  1974  to  May  1975. 
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Figure  3-40.  Cumulative  probability  of  8-12  pm  attenuation  obtained  during  Project  OPAQUE. 
Data  site  was  Meppen,  Germany  and  the  collection  period  was  the  Winter  of  1976-77.  [39] 


Figure  3-41.  Cumulative  probability  of  laser  line  attenuation  obtained  by  Plessey  Radar  Ltd. 
Data  site  was  the  Isle  of  Wight,  UK  and  the  collection  period  was  October  1974  to  May  1975. 


ATTENUATION  (dB/km) 
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The  attenuation  due  to  any  clouds  (except  cirrus)  is  usually  so  high  in  the  visible  and  infrared 
that  clouds  are  considered  opaque.  The  utility  of  visible  and  infrared  systems  looking  up  (or  down) 
through  the  atmosphere  depends  on  the  ability  to  find  a cloud-free  line  of  sight.  Kleiman  and 
Modica  analyzed  this  problem  and  came  up  with  a table  which  relates  the  probability  of  obtaining 
a cloud-free  line  of  sight  (CFLOS)  as  a function  of  line  of  sight  elevation  angle  and  the  meteorologi- 
cal cloud  cover.  [3  7]  Cloud  cover  is  often  reported  as  the  number  of  eighths  of  the  sky  that  is 
obscured  by  clouds.  A four  (eighths)  rating  implies  that  4/8  = 50%  of  the  sky  is  obscured.  The 
results  of  the  analysis  are  summarized  in  Table  3-13.  It  should  be  noted  that  even  at  zero  (eighths) 
- nominally  cloudless  - there  is  still  a small  possibility  of  having  a cloud  pass  through  the  line  of 
sight.  At  eight  (eighths)  - nominally  total  overcast  - there  is  a small  possibility  of  finding  a small 
hole  in  the  cloud  cover. 


Table  3-13.  Probability  of  cloud-free  line  of  sight  as  a function  of  partial  cloud  cover  (eighths) 

and  line  of  sight  elevation  angle. [37] 


ELEV. 

ANGLE 

(Peg) 

0 

1 

PROBABILITY  OF  CLOUD-FREE  LINE  OF  SIGHT 
CLOUD  COVER  (Eighths) 

2 3 4 5 6 

7 

8 

0 

0.962 

0.640 

0.490 

0.370 

0.275 

0.195 

0.095 

0.040 

0.020 

5 

0.965 

0.730 

0.600 

0.490 

0.380 

0.280 

0.180 

0.090 

0.025 

10 

0.970 

0.820 

0.695 

0.590 

0.485 

0.360 

0.250 

0.130 

0.030 

15 

0.973 

0.875 

0.750 

0.645 

0.540 

0.420 

0.300 

0.190 

0.035 

20 

0.975 

0.895 

0.805 

0.700 

0.600 

0.490 

0.365 

0.225 

0.040 

25 

0.978 

0.910 

0.825 

0.730 

0.645 

0.525 

0.400 

0.250 

0.045 

30 

0.980 

0.920 

0.850 

0.775 

0.680 

0.575 

0.430 

0.275 

0.050 

35 

0.983 

0.930 

0.860 

0.790 

0.695 

0.595 

0.460 

0.290 

0.055 

40 

0.985 

0.940 

0.875 

0.800 

0.710 

0.610 

0.480 

0.300 

0.060 

45 

0.988 

0.945 

0.880 

0.815 

0.725 

0.630 

0.500 

0.320 

0.065 

50 

0.990 

0.950 

0.890 

0.825 

0.740 

0.650 

0.510 

0.325 

0.070 

55 

0.993 

0.952 

0.895 

0.830 

0.750 

0.660 

0.520 

0.330 

0.075 

60 

0.995 

0.954 

0.898 

0.840 

0.765 

0.675 

0.525 

0.340 

0.080 

90 

1.000 

0.960 

0.900 

0.850 

0.780 

0.680 

0.545 

0.350 

0.085 
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Another  useful  set  of  statistics  is  the  cumulative  probability  of  rainfall  rate  as  a function  of 
geographic  location.  Barring  explicit  data  sets  such  as  those  in  Figure  3-34  there  are  several  other 
ways  to  obtain  rain  rate  estimates.  Figure  3-42  shows  the  worldwide  rain  climate  zones  used  at  one 
time  by  the  Consultative  Commission  on  International  Radio  of  the  ITU.  [31]  Table  3- 1 4 shows  data 
on  the  percentage  of  time  each  year  that  the  rain  rate  exceeds  specified  levels  (this  is  cumulative 
probability  data  presented  in  a slightly  different  format.  Similar  plots  and  tables  have  been 
generated  by  Crane. [40]  The  existence  of  local  microclimates  (small  regions  a few  square 
kilometers  in  size  where  terrain  and  windflow  are  much  more  or  much  less  favorable  to  precipita- 
tion) make  it  hazardous  to  rely  on  such  global  estimation  techniques.  It  should  be  noted  that  MIL- 
STD-210C  gives  data  on  climatic  extremes  (and  occasionally  percentage  exceedance)  values  for  a 
number  of  meteorological  variables  of  importance  to  system  design.  [41]  For  comparison  with  the 
data  in  Table  3-14,  MIL-STD-210C  gives: 

* highest  1-min  rainfall  = 31.2  mm  (=  1807.2  mm/hr) 

* highest  1-hr  rainfall  = 305  mm 

* highest  12-hr  rainfall  = 1350  mm  (=  1 12.5  mm/hr) 

* highest  24-hr  rainfall  = 1880  mm  (=  78.3  mm/hr) 

* highest  5-day  rainfall  = 4300  mm  (=  35.8  mm/hr). 


Figure  3-42.  CCIR  Rain  Climate  Zones. [31] 

LONGiTUDE 
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Table  3-14.  Rain  Rate  Distributions  for  the  CCIR  Rain  Climate  Zones.  [31] 


RAIN  RATE  DISTRIBUTION  VALUES  (mm/hr) 
PERCENTAGE  OF  TIME  (%)  & (TIME  PER  YEAR)  EXCEEDED 


CLIMATE 

ZONE 

1.0 

(87.7h) 

0.3 

(26.3h) 

0.1 

(8.77h) 

0.03 

(158m) 

0.01 

(52.6m) 

0.003 

(15.8m) 

0.001 

(5.26m) 

A 

- 

1 

2 

5 

8 

14 

22 

B 

1 

2 

3 

6 

12 

21 

32 

C 

- 

3 

5 

9 

15 

26 

42 

D 

3 

5 

8 

13 

19 

29 

42 

E 

1 

3 

6 

12 

22 

41 

70 

F 

2 

4 

8 

15 

28 

54 

78 

G 

- 

7 

12 

20 

30 

45 

65 

H 

- 

4 

10 

18 

32 

55 

83 

J 

- 

13 

20 

28 

35 

45 

55 

K 

2 

6 

12 

23 

42 

70 

100 

L 

- 

7 

15 

33 

60 

105 

150 

M 

4 

11 

22 

40 

63 

95 

120 

N 

5 

15 

35 

65 

95 

140 

200 

P 

12 

34 

65 

105 

145 

200 

250 
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Problems 


3- 1 . Identify  the  major  potential  contributors  to  atmospheric  extinction  and  qualitatively  describe 

the  altitude,  wavelength,  and  weather  regimes  in  which  each  is  a major  factor. 

3-2.  What  is  an  "atmospheric  window"?  Name  or  describe  by  frequency  or  wavelength  at  least 
five  major  atmospheric  windows. 

3-3.  Describe  the  relation  between  extinction,  absorption,  and  scattering. 

3-4.  Smoke  from  a nearby  fire  reduces  the  visibility  in  an  area  to  500  meters.  Without  any  other 
information  what  would  you  estimate  the  scattering  coefficient  at  550  nm  wavelength  to  be? 

3 -5 . A chemical  pollutant  causes  only  absorption  (no  scattering)  at  1 0 pm  wavelength.  The  total 

extinction  at  this  wavelength  is  estimated  to  be  1.0  km  '.  Normal  atmospheric  constituents 
are  expected  to  contribute  to  produce  a background  absorption  coefficient  of  0.5  km  ' and 
a scattering  coefficient  of  0.3  km'.  What  is  the  absorption  coefficient  of  the  pollutant?  If 
the  atmospheric  extinction  contributions  increased  two-fold  and  the  pollutant  concentration 
increased  ten-fold,  what  total  extinction  coefficient  would  result? 

3-6.  What  meteorological  parameter  defines  the  vertical  structure  of  the  atmosphere? 

3-7.  Sketch  the  behavior  of  temperature,  pressure,  humidity,  & aerosol  density  vs.  altitude.  Use 
the  same  vertical  scale  for  each  sketch. 

3-8.  A temperature  gradient  of  8 degrees  per  1000  meters  exists  in  an  air  mass.  Is  this  air  mass 
considered  stable  or  unstable? 

3-9.  What  atmospheric  constituents  are  almost  entirely  absent  at  altitudes  above  15000  meters? 
What  atmospheric  constituent  is  stronger  at  15000  meters  than  at  sea  level?  Do  not  consider 
clouds. 

3-10.  Given  an  atmosphere  containing  O2,  N2,  CO2,  H2O,  and  O3: 

a.  Which  species  produce  ultraviolet  absorption  between  200  and  300  nm? 

b.  Which  species  have  strong  absorption  lines  between  10  and  100  GHz? 

c.  Which  species  have  strong  absorption  bands  between  1 and  10  pm? 

3-11.  If  the  ambient  temperature  is  305  K,  the  absolute  humidity  is  30  g/m^,  and  the  rain  rate  is  25 
mm/hr,  estimate  the  sea  level  microwave  absorption  coefficient  at  15  GHz. 

3-12.  For  this  problem  assume  that  the  " signal- to-noise  ratio"  of  a radar  scales  as 

SNR  cc  . At  a particular  sea- level  radar  test  range  the  average  absolute  humidity 

is  7.5  g-m'^.  At  this  range  one  model  of  a 17  GHz  surface  search  radar  has  a detection  range 
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of  20km  against  a standard  target.  Ignoring  all  other  possible  phenomena  that  can  affect 
range,  plot  the  detection  range  against  the  standard  target  of  this  radar  as  a function  of 
absolute  humidity  ranging  from  0 g-m'^  (Arctic  conditions)  to  30  g-m'^  (tropical  conditions). 
Six  to  ten  well-chosen  data  points  are  sufficient  to  be  plotted. 

3-13.  If  the  dewpoint  is  293. 15  K,  estimate  the  saturation  humidity. 

3-14.  For  this  problem  assume  that  the  "signal-to-noise  ratio"  of  a radar  scales  as 

SNR  cc  . At  the  same  test  range  and  for  the  same  standard  target,  two  different 

radars  give  the  same  detection  range  of  50  km.  One  radar  operates  at  1 GHz;  the  second 
radar  operates  at  25  GHz  (near  a peak  absorption  of  water  vapor).  A third  radar  operates  at 
60  GHz  (at  the  peak  of  an  oxygen  absorption)  and  has  a detection  range  of  1 km.  Ignoring 
all  other  phenomena  that  can  affect  range,  plot  the  variation  of  detection  range  with  altitude 
over  the  interval  from  0 km  altitude  to  30  km  altitude.  Six  to  ten  well-chosen  data  points  are 
sufficient  to  be  plotted. 

3-15.  If  the  aerosol  concentrations  of  four  model  atmospheres  (that  vary  only  in  aerosol  type  and 
concentration)  are  adjusted  such  that  the  extinction  coefficients  at  10  pm  of  the  four  major 
LOWTRAN  aerosol  models  (rural,  urban,  maritime,  and  tropospheric)  are  equal,  which 
model  will  have  higher  extinction  at  1.0  pm? 

3-16.  If  the  visibility  is  10  km,  estimate  the  absorption  coefficient  of  a rural  aerosol  at  5.0  pm. 

3-17.  A millimeter-wave  radar  (at  98  GHz)  is  proposed  for  an  anti-aircraft  seeker.  At  maximum 
range  (10  km)  the  seeker  has  1 0 dB  margin  for  weather  penetration.  Will  clouds  cause  this 
seeker  to  fail? 

3-18.  Water  vapor  causes  a 5 dB/km  extinction  at  a wavelength  in  the  near  infrared.  What  rain 
rate  will  produce  an  extinction  of  equal  magnitude? 

3.19.  Using  the  data  in  Figures  3-34  through  3-37  estimate  the  rain  rate  that  should  be  used  to 
calculate  extinction  if  greater  than  95%  probability  of  operation  is  desired  in  any  geographic 
locale.  Estimate  the  visibility  that  should  be  assumed  for  the  equivalent  aerosol  extinction 
calculations. 

3-20.  You  are  responsible  for  developing  a new  low-altitude  (<  100  m max.  alt.),  subsonic,  land 
attack  cruise  missile.  The  Program  Executive  Officer  (PEO)  received  his  training  in  the  70's. 
He  demands  that  use  an  "all-weather"  altimeter  (implying  an  existing  17  GHz  radar 
altimeter).  You  know  that  this  device  can  be  detected,  localized,  identified,  and  tracked  by 
a new  generation  of  threat-owned,  netted,  radio-frequency  interferometers.  As  a result  most 
of  the  missiles  can  be  easily  downed.  In  addition  to  tactfully  explaining  the  above,  what  can 
you  say  to  the  PEO  about  the  existence  of  alternatives  and  the  true  meaning  of  "all  weather"? 
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3-21.  You  are  responsible  for  picking  the  sensors  for  an  "air  vehicle"  to  be  used  during  the  first 
manned  mission  to  Mars.  Assume  that  the  Martian  atmosphere  is  almost  completely 
composed  of  nitrogen  and  carbon  dioxide  with  no  oxygen,  ozone,  or  water  vapor,  and  that 
the  average  surface  temperature  is  21  OK. 

(1)  Which  of  the  following  sensors  are  likely  to  experience  a significant  problem  from 
atmospheric  attenuation  (ignoring  the  effects  of  Martian  aerosols): 

a)  a 60  GHz  terrain  mapping  radar? 

b)  a high-resolution  thermal  imager  operating  in  the  4. 0-4. 5 pm  wavelength  band 

and  designed  for  surveying  land  travel  routes? 

c)  a general-purpose  thermal  imager  operating  in  the  8-12  pm  band? 

d)  an  image  intensified  optical  viewer  operating  in  the  0. 9-1.0  pm  band? 

e)  a "solar-blind"  ultraviolet  sensor  operating  in  the  0.2 1 -0.29  pm  band  and  designed 

to  detect  the  low-intensity  ultraviolet  beacons  on  previous  unmanned  landers? 

(2)  Which  of  the  above  sensors  will  be  significantly  affected  by  the  Martian  dust  storms 
(assume  that  all  dust  particles  have  diameters  less  than  1 pm)?  Which  of  these  sensors  will 
be  marginally  affected? 

(3)  Do  any  of  the  above  sensors  have  serious  problems  other  than  molecular  attenuation  or 
aerosol  attenuation?  Which  sensors  have  what  problems?  Consider  things  like  signature 
limitations  or  noise  sources. 

(4)  Which  of  these  sensors  will  be  difficult  to  flight  test  here  on  Earth?  Why?  (be  specific) 

3-22.  A laser  communication  link  is  required  for  ship-to-ship  covert  communication  over  distances 
less  than  20  km.  Two  strong  candidates  present  themselves:  a GaAs  diode  laser  link 
operating  at  0.89  pm  and  a CO2  laser  link  operating  at  10.59  pm.  Considering  only  the 
effects  of  the  atmosphere  on  system  performance,  compare  the  two  links.  List  aU  atmo- 
spheric propagation  phenomena  which  are  likely  to  adversely  affect  the  two  systems  and  rate 
each  phenomenon  as  to  whether  it:  1)  negatively  affects  the  GaAs  link  much  more  than  it 
does  the  CO2  link,  2)  negatively  affects  the  GaAs  link  somewhat  more  than  the  CO2  link,  3) 
affects  both  links  equally,  4)  negatively  affects  the  CO2  link  somewhat  more  than  the  GaAs 
link,  or  5)  negatively  affects  the  CO2  link  much  more  than  it  does  the  GaAs  link.  Present 
your  results  in  the  form  of  a table.  If  the  relative  impact  of  any  phenomenon  can  be 
quantitatively  evaluated  without  use  of  computer  codes,  give  a numerical  number  for  that 
relative  impact.  If  any  phenomenon  affects  either  the  GaAs  or  CO2  links,  but  does  not  s- 
ignificantly  affect  the  other,  please  indicate  these  unilateral  impacts.  NOTE:  multipath  is 
not  a problem  for  these  two  systems. 

3-23.  A rangefinder  is  required  to  provide  accurate  tracking  of  ballistic  missiles  at  ranges  up  to 
500  km  for  an  airborne  high  energy  laser  defense  system.  Both  the  airborne  laser  and  the 
targets  will  be  at  altitudes  above  10  km.  Serious  candidates  exist  incorporating  YAG  lasers 
at  1.06  pm  and  CO2  lasers  at  10.59  pm.  Perform  a comparative  analysis  of  these  two 
candidates  considering  only  atmospheric  propagation  effects. 
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CHAPTER  4 


PROPAGATION  OF  ELECTROMAGNETIC  RADIATION.  III. 
- REFRACTIVE  EFFECTS 


Horizon  Extension 

The  atmosphere  has  a refractive  index  that  is  close  to  unity,  but  nevertheless  not  equal  to 
unity.  This  refractive  index  is  a function  of  temperature,  pressure,  and  humidity,  as  well  as 
wavelength.  As  a consequence,  there  is  considerable  spatial  and  temporal  inhomogeneity  in  the 
atmospheric  refractive  index.  This  innate  variability  allows  the  atmospheric  refractive  index  to 
produce  a number  of  interesting  and  significant  effects  on  the  propagation  of  electromagnetic 
radiation. 

One  characteristic  of  the  atmospheric  refractive  index  is  its  relatively  predictable  variation 
with  altitude.  In  the  visible  and  infrared  regions  of  the  spectrum,  the  refractive  index  of  air  is  given 
with  reasonable  accuracy  by  an  expression  due  to  Edlen  [ 1 ] . As  mentioned  in  the  preceding  chapter, 
the  refractive  index  n of  air  is  more  commonly  expressed  in  terms  of  the  refractivity  N 

V = («-l)xlO'’  (4.1) 

According  to  Edlen,  the  refractivity  of  air  may  be  determined  from  the  expression 


N = 


83.434 


185.08 

l-(0.1603//lf 


[pa-Ph,o)  29615 
1013.25  T 


4.11 

1- (0.0877 //if 


+ 


43.49 -(0.0588/;if 


Ph^o 

1013.25 


(4.2) 


where  X = wavelength,  = total  pressure  (mbar),  Pu20  = partial  pressure  of  water  vapor  (mbar), 
and  r = temperature  (K).  The  partial  pressure  of  water  vapor  may  be  calculated  from  the  relative 
humidity  and  the  temperature  using  the  expression 
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(4.3) 


6.105  exp 


^,^^r-273.15  ,,,,  r T ^ 

25.22 5.31  In  

T 1 273.15 


In  the  microwave  spectral  region,  the  refractivity  may  be  determined  from  the  expression  [2] 


^^77^_^3J3xH)^ 


(4.4) 


Microwave  propagation  specialists  often  define  a quantity  called  the  modified  refractivity  M 


M = N + 0.\51h 


(4.5) 


where  h is  the  height  above  the  ground  in  meters. 

Figure  4-1  shows  the  microwave  refractivity  as  a function  of  altitude  for  the  U.  S.  Standard 
atmosphere  along  with  the  gradient  (per  km)  of  that  refractivity.  Note  that  for  this  “standard” 
condition,  there  is  strong  negative  gradient  This  also  means  that  as  the  refractivity  decreases  with 
increasing  altitude,  the  speed  of  light  increases  with  increasing  altitude.  Remembering  Snell’s  law 


Figure  4-1.  Refractivity  and  its  gradient  (per  km)  of  the  U.  S.  Standard  atmosphere. 
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(Figure  4-2), 


sin  6y  = «2  sin  6^  = constant  for  a given  ray  path , (4.6) 


we  would  expect  upward  traveling  rays  to  bend 
away  from  the  vertical  as  they  propagated.  To 
see  this,  consider  an  atmosphere  with  a vertical 
gradient  that  has  been  divided  into  layers  of  a 
finite  thickness  5.  Each  successively  higher  layer 
has  a slightly  larger  speed  of  light  (lower  index  of 
refraction).  This  is  essentially  a finite  element 
approximation  to  the  real  atmosphere.  If  5 were 
to  become  infinitesimally  small  and  the  number 
of  layers  were  to  become  infinite,  the  model 
would  approach  reality  (which  is  continuous). 
Part  of  such  a model  is  shown  is  shown  in  Figure 
4-3.  As  a ray  moves  upward,  propagating  from 
one  layer  to  the  next,  it  undergoes  successive 
Snell’s  law  refractions  that  cause  it  to  bend  more 
and  more  away  from  vertical  propagation.  Such 
bending  will  occur  in  a continuous  atmosphere 
with  a gradient,  although  the  bending  will  take 
the  form  of  a continuous  curvature  rather  than 
discrete  bends. 

The  radius  of  curvature  is  related  to  the 
gradient.[2],[3]  That  this  is  so  can  be  easily 
demonstrated  geometrically.  Consider  the  strati- 
fied geometry  of  Figure  4-4.  For  a small  enough 
angle  5a,  the  radius  of  curvature  r can  be  deter- 
mined from 

X = r Sa  (4.7) 

where 

x = x^+x^ 

= Sh  I + Sh  I Icosa^ 

^Shlcosa  (4.8) 


Figure  4-2.  Snell’s  law  geometry. 


Figure  4-3.  Ray  curvature  in  an  atmosphere 
with  a negative  refractivity  gradient. 


Combining  these  two  results  yields  the  following  relation  for  the  radius  of  curvature 

r = Sh!  [Sa  cos«)  . (4.9) 
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Figure  4-4.  Radius  of  curvature  resulting  from  a refractive  index  gradient. 


If  we  now  differentiate  Snell’s  Law  we  obtain 


or 


Sn  sin  ct  + n cos  a da  = 0 
Sa  I Sn  = -(1/  n)tMva  . 


Combining  Eqs.  (4.9)  and  (4.11)  yields 

r = -n  Sh  / Sn  sin  a . 


For  nearly  horizontal  propagation  in  air,  n ~ 1,  and  a~  90°,  and 


r = -Sh  / Sn 


(4.10) 

(4.11) 

(4.12) 

(4.13) 


A positive  radius  means  the  center  of  curvature  lies  below  the  ray.  The  “curvature”  p of  the  ray  is 
the  inverse  of  the  radius  of  curvature. 
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(4.14) 


da  1 dn 

dx  r dh 


If  the  propagation  path  now  lies  above  a curved  earth  (curvature  = \IR^,  where  is  the 
radius  of  the  Earth)  the  curvature  of  the  ray  relative  to  the  Earth’s  surface  is 


, 1 dn 

p — 1 

R dh 


with  an  effective  radius  R 


1 _ 1 dn 
R dh 


(4.15) 


(4.16) 


Multiplying  by  the  true  Earth  radius  gives 


R " dh  k 


(4.17) 


where  k is  the  ratio  of  the  effective  radius  to  the  true  radius.  The  curvature  of  the  ray  relative  to  the 
surface  of  an  earth  with  effective  radius  R = kR^  is  zero.  The  ray  will  act  as  if  it  follows  a straight 
line  path  in  this  effective  earth.  A nominal  average  value  of  dn/dh  = -3.9x10*  m ' in  the  microwave 
region  of  the  spectrum.  [2]  Given  a true  Earth  radius  of  approximately  6.37  x 10®m,  we  find  that 
a typical  value  of  k is  roughly  4/3. 

Because  of  this,  “normal”  refraction  will  affect  the  distance  at  which  objects  near  the  horizon 
may  be  detected.  Assuming  an  effective  earth  radius  allows  refraction  to  be  neglected  and  any  ray 
to  be  treated  as  a straight  line.  Consider  a spherical  earth  of  arbitrary  radius  R,  as  shown  in  Figure 
4-5.  The  distance  L to  the  horizon  (tangent  point)  for  an  object  at  height  H above  the  surface  can 
be  calculated  from  the  expression 


(R  + H)^  =R^  + L^ 


(4.18) 


For  any  H small  compared  to  the  radius  R,  it  is  possible  to  use  the  approximate  result 


L^[H^+  2HRf^  « (iHRf^  . 


(4.19) 
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Figure  4-5.  Geometry  of  determining  the  distance  to  the  horizon. 


The  true  average  radius  of  the  Earth  is 

= 637 1.0  km  = 3440.065  nmi . (4.20) 

Thus,  the  geometric  distance  to  the  horizon  [R  = R^]  on  the  Earth  is 

L — 3.570  km  (7/ in  meters)  (4.21) 

Z = 1.064 nmi  (77  in  feet).  (4.22) 

In  the  microwave  region  of  the  electromagnetic  spectrum  we  may  assume  that  the  effective 
radius  of  the  Earth  in  the  microwave  region  is  4/3  times  the  true  Earth  radius,  as  we  previously 
calculated  using  Eq.  (4. 1 7).  This  validates  our  use  of  the  straight  line  equation  (4. 1 9).  Substituting 
R = (4I3)R^  into  Eq.  (4.19)  leads  to  the  following  simple  expressions  for  the  radar  horizon 

L — 4.122  km  (77  in  meters)  (4.23) 

L^\.229H^'^  nmi  (77  in  feet).  (4.24) 

The  increase  in  radar  horizon  given  by  these  expressions  is  reasonable  under  normal  conditions.  If 
superreffactive  or  ducting  conditions  exist  (see  below),  then  the  effective  horizon  may  exceed  these 
estimates. 
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In  the  visual  and  infrared  regions  of  the  spectrum,  there  is  also  refraction.  In  general  this 
refraction  is  not  as  strong  as  that  in  the  microwave  region.  The  visual  horizon  may  be  estimated 
using  a 7/6  Earth  radius. [4]  Substituting  R = (7/6)7?^  into  Eq.  (4.19)  leads  to  the  following  simple 


expressions  for  the  visual  horizon 

Z/ = 3.856 km  (iZ  in  meters)  (4.25) 

L = \149H^^^  nmi  (iZ  in  feet).  (4.26) 

The  easy-to-remember  fraction  7/6  (=1.1667)  is  the  nearest  simple  fraction  to  the  true  mean  value 
of  1 .1662.  Slightly  better  horizon  distances  could  be  found  from  the  equations 

L = 3.839  km  (ZZ in  meters)  (4.25A) 

L = 1.144  nmi  (ZZ  in  feet).  (4.26A) 


if  desired.  The  alternate  set  of  expressions  (Eq.  (4.25  A)  & (4.26A))  are  more  accurate  and  preferred. 

The  total  distance  to  an  object  that  appears  right  on  the  horizon  is  the  sum  of  the  distance 
from  the  object  to  the  horizon  and  the  distance  from  the  horizon  to  the  observer 

^TOTAL  = Lj  + Lq  = + ZZq  ^ ) , (4.27) 

where  A is  one  of  the  numerical  coefficients  from  the  equations  above. 

It  is  interesting  to  note  that  if  we  select  p ’ equal  to  zero,  then  the  ray  curvature  relative  to  the 
Earth’s  surface  is  zero,  a ray  initially  traveling  parallel  to  the  surface  remains  parallel  even  as  it 
travels  completely  around  the  world.  This  condition  is 

— = — ^ = -1.57x10“’ m“'  (4.28) 

dh 

or  in  terms  of  refractivity 

— = -0.157m'‘.  (4.29) 

dh 

This  is  the  origin  of  the  Q.\51h  term  in  the  conversion  between  N and  M (in  Eq.  (4.5)).  If  the 
refractivity  gradient  is  more  negative  than  this  then  every  ray  (in  an  infinitely  high  atmosphere)  has 
enough  curvature  to  ultimately  bend  back  and  intercept  the  earth.  The  rays  may  be  said  to  be 
“trapped”. 
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Ducting  and  Mirages 

As  explicitly  demonstrated  in  Eq.  (4.4),  the  refractivity  is  a function  of  pressure,  temperature, 
and  water  vapor  concentration.  Except  in  severe  storms  with  strong  vertical  winds,  total  pressure 
almost  always  falls  off  smoothly  with  increasing  altitude.  However,  temperature  anomalies  (varia- 
tions in  lapse  rate,  possibly  including  inversions)  and  vertical  variations  in  humidity  (variations  in 
gradient  due  to  excessively  low  or  high  surface  humidity  or  even  layers  of  abnormally  low  or  high 
humidity)  can  exist  in  the  atmosphere.  For  these  reasons,  the  actual  bending  of  ray  paths  need  not 
follow  the  simple  relations  presented  above  (and  corresponding  to  “normal  refraction”). [5] 

Consider  Figure  4-6.  If  the  refractive  gradient  is  less  negative  (or  even  positive)  then  the 
curvature  is  less  than  “normal”  (radius  of  curvature  is  larger  than  normal)  or  curved  in  the  opposite 
direction  (away  from  the  surface).  This  atmospheric  condition  is  called  “subrefractive”.  If  the 
refractive  gradient  is  more  negative  than  normal  then  the  curvature  is  larger  than  normal  (radius  of 
curvature  is  smaller  than  normal).  This  atmospheric  condition  is  called  “superrefractive”.  In 
extreme  conditions,  the  curvature  may  become  so  large  that  the  rays  will  bend  more  quickly  than 
the  earth  curvature.  Even  rays  initially  directed  away  from  the  surface  will  curve  until  they  strike 
the  earth’s  surface  and  reflect.  This  is  called  a “trapping”  condition,  because  the  rays  cannot 
escape. 


The  refractivity  gradients  that  correspond  to  these  conditions  are  described  in  Table  4-1. 
Trapping  occurs  when  the  gradient  in  the  modified  refractivity  becomes  negative.  “Normal” 
refraction  is  assumed  to  occur  whenever  the  modified  refractivity  is  between  79  and  157  km  '. 


Figure  4-6.  Four  modes  of  propagation  in  a refractive  atmosphere. 


SUBREFRACTION 
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Table  4-1.  Relation  of  refractivity  gradient  to  refractive  behavior. 


BEHAVIOR 

TRAPPING 

SUPERREFRACTIVE 

NORAAAL 

SUBREFRACTIVE 


N-GRADIENT  (N/kml 

< -157 
- 157  to  -79 
- 79  to  0 
> 0 


M-GRADIENT  (M/km) 
< 0 

Oto  79 
79  to  1 57 
> 157 


Trapping  is  such  an  anomalous  behavior,  when  it  occurs,  that  it  must  have  profound  impacts 
on  sensor  performance.  For  this  reason  it  will  be  examined  in  more  detail.  The  first  question  is  what 
conditions  lead  to  a negative  M-gradient?  We  will  begin  in  the  microwave  region  of  the  electro- 
magnetic spectrum.  Combining  Eqs.  (4.4)  and  (4.5)  yields  the  expression 


M = 


T 


f0.157/t 


(4.30) 


The  only  factors  in  this  expression  which  are  capable  of  very  anomalous  behavior  are  the  water 
vapor  density  and  the  temperature.  A negative  gradient  requires  either  a very  rapid  decrease  in 
water  vapor  concentration  or  a very  rapid  increase  in  temperature.  As  we  have  seen  in  Figure  4-1 
(add  157  to  the  N-gradient  shown  to  get  the  M-gradient),  neither  the  rapid  falloff  of  water  vapor  in 
the  normal  troposphere  nor  the  temperature  inversion  at  the  tropopause  are  adequate  to  make  the  M- 
gradient  negative.  First,  consider  dry  air  at  sea  level.  In  the  absence  of  water  vapor  the  refractivity 
becomes 


,.^1  78609 

M = + 0.l51h 

1/7^=1013  T 


Taking  the  derivative  with  respect  to  height  gives  the  gradient 


dM 


or 


dh 

dM 


= 0.157 


/7^=1013 mbar;  Ph20^^ 


78609  dT 


dh 


= 0.157-1.054 


p^=1013mbar;  Ph20=^'<  r=273.15K 


dh 


(4.31) 

(4.32) 

(4.33) 


where  the  last  expression  has  been  explicitly  evaluated  at  a temperature  of  273.15  K.  The  M- 
gradient  becomes  negative  if  the  temperature  gradient  exceeds 
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>0.149  K/m 


(4.34) 


dh 


/?^=1013  mbar;  p^^Q^O; 


r=273.15K 


This  is  a substantial  temperature  gradient  but  one  which  could  easily  occur  in  a situation  where 
warm  air  moved  over  a cold  surface  or  cold  air  mass. 

Now  consider  moist  air.  If  we  once  again  evaluate  the  modified  refractivity  expression  at 
a temperature  of  273.15  K,  we  find 

"l.=,.B„w;r=™.,=K  = 287.79  + 5.000;,„,„  + 0.157ft  (4.35) 


The  M-gradient  becomes 


dM 

dh 


0.157  + 5.000 


^Ph20 

dh 


(4.36) 


which  in  turn  becomes  negative  if  the  water  vapor  pressure  gradient  is  sufficiently  negative,  i.e., 


^Ph20 

dh 


<-0.0314  mbar/m. 


(4.37) 


At  a temperature  of  273 . 1 5 K,  the  water  vapor  pressure  can  be  calculated  from  the  relative  humidity 
from  the  relation 


Ph.^ 


o 


r=273.15K 


RH 


X 6.105  mbar 


(4.38) 


Given  partial  pressures  of  several  mbar,  a gradient  of  a few  hundredths  of  a mbar/meter  seems 
relatively  easy  to  achieve. 

Indeed  we  find  such  gradients  over  virtually  every  body  of  water.  The  ocean  is  the  primary 
source  of  water  in  the  overall  hydrologic  cycle  (ocean  water  evaporates;  ocean  air  moves  over  land 
causing  precipitation;  water  returns  to  ocean  through  streams  and  rivers).  The  air  in  intimate  contact 
with  the  surface  of  the  water  will  be  fully  saturated  with  water  vapor  at  the  water  temperature.  As 
the  saturated  air  moves  away  from  the  surface,  it  will  mix  with  drier  air  causing  a negative  gradient. 
The  gradient  will  diminish  with  increasing  distance  until  at  a height  of  a few  meters  to  a few  tens 
of  meter  the  negative  gradient  disappears  and  more  normal  refractivity  behavior  resumes.  The  entire 
height  region  from  the  ground  to  the  point  at  which  a positive  M-gradient  resumes  is  called  an 
evaporation  duct  (EVD).  The  height  profile  of  a typical  evaporation  duct  is 
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/"A + 0.00015^ 


(4.39) 


M{h)  = M 


SURFACE 


h 5^ 

+ In 

8 8 


V 


0.00015 


where  5 is  the  height  of  the  top  of  the  duct.  This  is  shown  schematically  in  Figure  4-7. 


Occasionally  conditions  arise  where  layers  of  moist  air  are  forced  to  move  over  a layer  of 
less  moist  air.  These  can  lead  to  the  three  other  kinds  of  ducts  as  shown  in  Figure  4-7.  If  the 
trapping  layer  is  more  extended  than  an  evaporation  duct,  but  touches  the  surface,  we  have  a surface 
duct  (SD).  The  trapping  layer  is  that  range  of  heights  where  the  M-gradient  is  negative  (i.e.,  the 
refractivity  indicates  trapping  behavior).  If  the  trapping  layer  does  not  extend  to  the  surface,  but  the 
minimum  modified  refractivity  in  the  layer  is  smaller  than  the  modified  refractivity  at  the  surface, 
we  have  a surface-based  duct  (SBD).  Although  the  trapping  layer  is  elevated,  the  duct  it  produces 
(see  below)  extends  to  the  ground.  If  the  trapping  layer  is  sufficiently  elevated,  the  minimum 
modified  refractivity  will  not  be  smaller  than  the  value  at  the  surface.  In  this  case  we  have  an 
elevated  duct  (ED).  The  duct  will  extend  from  the  top  of  the  trapping  layer  down  to  the  height  at 
which  modified  refractivity  below  the  trapping  layer  equals  the  minimum  modified  refractivity  at 
the  top  of  the  trapping  layer.  The  position  and  thickness  of  the  duct  regions  and  the  trapping  layers 
are  indicated  for  all  four  kinds  of  ducts.  As  mentioned  earlier,  evaporation  ducts  are  inevitable  over 
water.  The  other  kinds  of  ducts  are  known  to  occur  occasionally  if  not  frequently  over  land  as  well 
as  over  water,  but  data  is  limited. 


Figure  4-7.  Types  of  ducts  resulting  from  trapping  layers  in  the  microwave. 
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Statistics  on  some  ducts  in  maritime  environments  has  been  summarized  in  a portion  of  a 
computer  code  called  Engineer’s  Refractive  Effects  Prediction  System  (EREPS).[5]  A component 
of  this  code  called  SDS  produces  statistical  outputs  on  demand.  Some  of  the  available  data  is 
summarized  in  Table  4-2.  Using  the  SDS/EREPS  data  we  have  calculated  the  evaporation  duct 
heights  at  the  10  percentile,  50  percentile,  and  90  percentile  cumulative  probability  levels  for  26 
maritime  locations  around  the  world  and  for  the  world  as  a whole.  We  have  also  listed  the  average 
value  of  k,  the  refractivity  at  the  surface  the  probability  of  occurrence  of  surface-based 

ducts  (SBD),  and  the  average  SBD  duct  height  for  each  location.  Data  is  available  for  most  of  the 
ocean  surface  averaged  over  each  Marsden  square.  Each  Marsden  square  is  a 1 0 degree  x 1 0 degree 
region  of  the  ocean  surface.  Figure  4-8  shows  the  Marsden  squares  for  which  duct  data  is  available. 
Figure  4-9  shows  an  example  of  the  output.  Data  are  not  available  for  the  Arctic  and  Antarctic 
regions,  nor  are  they  available  over  land  (however,  ducts  over  the  water  tend  to  extend  over  adjacent 
land  areas  although  may  elevate  as  they  move  farther  from  the  coasts).  Data  are  not  available  for 
surface  ducts  or  elevated  ducts. 


Table  4-2.  Maritime  duct  statistics  obtained  from  the  SDS  portion  of  the  EREPS  code. 


EVP  HEIGHT  (ml  SBD 


GEOGRAPHIC 

PERCENTILES 

AVERAGE 

SBD 

AV.  HT. 

LOCATION 

10 

50 

90 

K 

bUuRFACE 

M 

(m) 

Worldwide  Average 

4 

13.1 

22 

1.45 

339 

8.0 

85 

California  Coast 

2 

8.9 

16 

1.62 

328 

12.0 
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Gulf  of  Alaska 

1 

5.3 

10 

1.38 

321 

2.0 

48 

Hawaiian  Islands 

9 

16.8 

24 

1.51 

362 

20.0 

114 

Rapa,  French  Oceania 

3 

10.6 

18 

1.44 

355 

2.0 

154 

Guam 

10 

17.7 

25 

1.71 

380 

10.0 

107 

Coral  Sea 

8 

16.2 

25 

1.48 

353 

10.0 

90 

Tasman  Sea 

5 

13.5 

22 

1.41 

329 

21.3 

78 

Yellow  Sea 

3 

11.9 

21 

1.41 

322 

14.3 

83 

Paracel  Islands 

8 

16.3 

25 

1.59 

374 

14.0 

79 

North  Australian  Basin 

8 

17.2 

26 

1.69 

342 

19.0 

147 

Gulf  of  Thailand 

7 

14.9 

22 

1.68 

381 

14.4 

83 

Bay  of  Bengal 

8 

16.0 

24 

1.77 

378 

17.5 

109 

Diego  Garcia 

8 

15.0 

22 

1.60 

382 

22.0 

80 

Persian  Gulf 

6 

14.7 

25 

2.08 

357 

45.5 

202 

Central  Mediterranean 

5 

11.9 

20 

1.54 

337 

21.8 

122 

Cape  of  Good  Hope 

5 

13.1 

22 

1.41 

334 

2.0 

90 

Ascension  Island 

10 

16.9 

24 

1.37 

322 

1.7 

69 

Mauritanian  Coast 

8 

15.2 

22 

1.78 

345 

37.0 

148 

North  Sea 

1 

6.4 

12 

1.37 

322 

1.7 

69 

Greenland-lceland  Gap 

2 

5.9 

10 

1.34 

316 

3.5 

45 

N.  Atlantic  (44N,41W) 

2 

9.7 

17 

1.41 

336 

5.0 

68 

Grand  Banks 

1 

6.6 

15 

1.34 

315 

1.7 

68 

Bermuda 

6 

14.7 

24 

1.48 

347 

2.0 

200 

Bahamas 

8 

16.4 

25 

1.59 

370 

1.0 
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Caribbean  Sea 

10 

17.6 

25 

1.60 

371 

19.0 

122 

Cape  Horn 

2 

6.4 

14 

1.31 

312 

3.5 

48 
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Figure  4-8.  Availability  of  duct  data  by  Marsden  squares. 
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Figure  4-9.  Example  of  SDS/EREPS  duct  data  output. 
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It  is  worthwhile  to  summarize  the  data  in  Table  4-2.  Evaporation  duets  oceur  all  of  the  time 
over  large  bodies  of  water.  Duet  heights  can  range  from  a low  as  one  meter  to  as  high  as  40  meters 
but  5 to  25  meters  is  most  probable  with  1 3 meters  as  an  average.  Y early  average  values  of  N at  the 
water  surface  can  range  from  roughly  310  ppm  to  over  380  ppm  depending  on  location.  Surface- 
based  ducts  occur  only  8%  of  the  time  world  wide,  but  probabilities  of  occurrence  can  be  as  low  as 
1%  and  as  high  as  45%  depending  on  location.  Hot  & humid  locations  tend  to  have  high  probabili- 
ties of  SBD  occurrence,  while  cold  regions  tend  to  have  low  probabilities  of  SBD  occurrence.  SBD 
heights  average  85  meters  worldwide  but  can  range  from  50  meters  to  over  200  meters  depending 
on  location. 

We  have  described  the  anomalous  propagation  regions  resulting  from  trapping  layers  as 
ducts  although  we  have  not  yet  justified  this  description.  Consider  a trapping  layer  that  extends  to 
the  surface  and  assume  that  the  refractivity  gradient  is  a constant  negative  value,  as  shown  in  Figure 
4-10.  We  will  use  a flat  earth  illustration,  but  curved  earth  equations  in  the  computations.  For  a 
constant  negative  gradient,  the  ray  curvature  is  constant,  curved  downwards  toward  the  surface,  and 
given  by 


1 dn 

1 

dh 


(4.40) 


relative  to  the  Earth’s  surface.  For  many  rays  this  curvature  is  sufficient  to  cause  them  to  strike  the 


Figure  4-10.  Trapping  in  a negative  linear  gradient  extending  up  from  the  surface. 
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surface,  reflect  from  that  surface,  and  repeat  the  process  of  curving  back  towards  the  surface  until 
it  strikes  the  surface  again  (and  again ...).  This  distance  traveled  before  striking  the  surface  depends 
on  the  transmitter  height,  strength  of  the  gradient,  and  the  initial  propagation  direction.  If  the  height 
of  the  trapping  layer  minus  the  transmitter  height  is  less  than  the  radius  of  curvature  of  the  rays,  then 
some  of  the  rays  will  reach  the  top  of  the  trapping  layer.  When  this  happens,  the  curvature  abruptly 
changes  and  the  ray  diverges  from  the  surface  and  escapes  to  space.  Unless  the  relative  curvature 
above  the  trapping  layer  is  zero,  there  will  be  a wedge-shaped  region  in  which  it  is  not  possible  to 
propagate  any  ray  originating  from  the  transmitter.  This  “shadow  zone”  is  indicated  in  the  figure. 
The  existence  of  a shadow  zone  implies  a region  in  which  a target  can  exist  in  which  there  is  no 
illumination  by  the  transmitter.  In  a radar  system,  no  illumination  implies  no  detection.  The 
confinement  of  radiation  in  the  “duct”  will  often  lead  to  lower  propagation  losses  than  expected  from 
a \!R^  spreading. 

A different  situation  exists  if  the  transmitter  (or  receiver)  is  above  the  surface  duct  (see 
Figure  4-11).  Radiation  approaching  the  top  of  the  duct  at  too  shallow  an  angle  will  be  refracted 
away  before  it  reaches  the  top  of  the  trapping  layer.  Radiation  approaching  at  steeper  angles  will 
penetrate  the  duct.  These  rays  are  then  rapidly  bent  down  until  they  strike  the  surface.  After 
reflection  they  will  not  be  refracted  sharply  enough  to  prevent  them  from  leaving  the  duct  at  which 
point  they  are  rapidly  refracted  out  to  space.  In  this  example  it  is  apparent  that  there  is  a shadow 
zone  inside  the  duct  just  as  there  is  a shadow  zone  outside  the  duct  for  a transmitter  within  the  duct. 
Any  object  in  the  shadow  zone  is  incapable  of  transmitting  to  or  receiving  a signal  from  the 
transmitter/receiver. 


Figure  4-11.  Penetration  of  a surface  duct  by  a transmitter  above  the  duct. 
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The  ducts  in  Figures  4-10  and  4-11  act  via  a combination  of  reflection  and  refraction.  If  the 
trapping  layer  is  elevated,  as  shown  in  Figure  4-12,  then  reflection  no  longer  plays  a role. 
Nevertheless,  a duct  can  be  formed  using  only  refraction.  In  the  trapping  layer  there  is  strong 
downward  curvature.  Just  below  the  trapping  layer  there  is  upward  curvature.  The  magnitude  of 
the  downward  curvature  is  independent  of  the  magnitude  of  the  upward  curvature.  Consider  an 
emitter  within  the  duct  and  near  the  bottom  of  the  trapping  layer.  Rays  emitted  upward  are  curved 
downwards  by  the  trapping  layer;  rays  emitted  downward  are  curved  upward  by  the  “normal”  layer 
below  the  trapping  layer.  As  soon  as  an  upward  traveling  ray  is  bent  back  toward  the  surface  enough 
that  it  exits  the  bottom  of  the  trapping  layer,  it  experiences  a different  curvature  and  starts  to  bend 
upward  away  from  the  surface.  When  it  reenters  the  trapping  layer,  the  curvature  reverses  and  the 
cycle  begins  again.  The  analogous  process  happens  to  most  downward  traveling  rays.  Once  again 
we  have  a situation  in  which  periodic  oscillation  occurs  with  minimal  spreading  loss  in  the  vertical 
direction. 

If  an  initially  upward  traveling  ray  exits  the  trapping  layer  before  it  can  be  completely 
redirected  downward,  then  that  ray  will  escape  to  infinity.  If  an  initially  downward  traveling  ray  has 
enough  downward  motion  to  carry  it  beyond  the  lower  boundary  of  the  duct  (defined  by  a 
refract!  vity  equal  to  that  of  the  minimum  value  at  the  top  of  the  duct),  it  will  either  strike  the  surface 
and  reflect  or  it  will  reverse  direction  and  curve  upward.  In  either  case,  the  ray  has  enough  upward 
direction  that  it  will  go  past  the  top  of  the  trapping  layer  and  escape.  A shadow  zone  exists  in 
elevated  ducts  as  well  as  surface  ducts. 


Figure  4-12.  Trapping  by  an  elevated  negative-gradient  layer. 
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Figures  4-10  through  4-12  are  drawn  assuming  only  linear  gradients.  In  this  case  all 
curvatures  are  constant  and  it  is  easily  to  trace  the  rays.  However,  as  evidenced  by  the  profde  of  an 
evaporation  duct,  Eq.  (4.39),  linear  gradients  are  the  exception  not  the  rule.  Although  the 
instantaneous  curvature  is  always  given  by  Eq.  (4.40),  if  the  gradient  is  a function  of  height,  then 
the  curvature  will  be  a function  of  height.  Tracing  of  ray  paths  will  be  more  difficult.  There  are  a 
number  of  approaches  to  determining  the  flow  of  radiation  in  a ducting  environment.  One  is  to 
analytically  solve  the  differential  equations.  This  is  practical  only  for  certain  idealized  profiles.  A 
second  approach  is  to  break  the  atmosphere  into  many  layers  assigning  a discrete  refractive  index 
to  each  layer.  A ray  may  then  be  traced  by  applying  Snell’s  law  to  each  interface  it  encounters. 
When  the  angle  of  incidence  of  the  ray  approaches  90°  to  within  some  very  tight  tolerance,  then  the 
ray  is  reflected.  This  “finite  element”  or  “finite  difference”  approach  is  schematically  illustrated 
in  Figure  4-13.  In  this  figure  we  have  highlighted  the  fact  that  propagation  velocity  is  inversely 
proportional  to  refractive  index.  Thus  rather  than  having  a local  maximum  of  refractive  index  (or 
refractivity)  define  the  duct,  a local  minimum  of  propagation  velocity  is  used  instead.  This  should 
give  the  reader  a hint  that  we  will  see  this  material  again  (when  we  discuss  propagation  of  acoustic 
radiation).  Light,  like  sound,  is  “lazy”  and  seeks  to  move  at  the  minimum  allowable  velocity. 
Wherever  possible,  sound  or  light  rays  oscillate  about  the  velocity  minimum  and  remain  in  the  duct. 
A third  approach  is  also  a finite  element  or  finite  difference  approach.  In  this  approach  the 
atmosphere  is  layered,  but  each  layer  is  assigned  a finite  curvature.  Each  ray  is  propagated  from 
layer  to  layer  along  piecewise  circular  paths. 

Although  the  first  method  can  only  be  used  occasionally,  there  is  one  profile  for  which  exact 
results  can  be  obtained:  the  parabolic  index  profile 

n(r)  -tIq-  0.5«2^^  (4.41) 

Assume  that  this  profile  is  symmetric  about  the  z-axis.  If  we  restrict  the  radial  distance  to  small 


Figure  4-13.  Finite  element  (finite  difference)  approach  to  ray  path  calculation. 
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values,  we  have  the  paraxial  approximation.  The  position  of  any  ray  can  be  expressed  as 

r — r[z)p  + z , (4.42) 

where  p is  a unit  vector  perpendicular  to  the  z-axis.  The  equation  describing  a curved  ray  can  be 
shown  to  be  [6], [7] 


d ( dr\ 
— n — 
ds\  ds ) 


= Vn 


(4.43) 


where  s is  the  distance  along  the  ray.  In  the  paraxial  approximation  this  becomes 


d^r  _ 1 dn(r) 

dz^  n(r)  dr 


Now  from  the  parabolic  index  profile  we  can  extract 


dn 

— = —n.r 
dr  ' 


Substituting  this  expression  into  Eq.  (4.44)  we  have 


d^r  fij 

— T ^ - — 
dz 


(4.44) 


(4.45) 


(4.46) 


which  has  sine  and  cosine  solutions.  Specifically,  if  rg  is  the  initial  value  of  the  radial  displacement 
and  Vg  is  the  initial  slope  of  the  ray,  then  the  solutions  can  be  written  as 


(4.47) 


The  parabolic  index  profile  is  often  a better  approximation  to  a real  profile  than  is  the  constant 
gradient  profile  assumed  in  our  earlier  analyses.  It  also  finds  real  application  in  gradient  index  fiber 
optics.  Figure  4-14  shows  an  elevated  duct  with  a hypothetical  parabolic  index  profile.  All  rays 
have  the  same  frequency  so  the  pattern  alternately  focuses  and  defocuses  as  it  propagates  down  the 
duct.  We  will  see  such  refocusing  behavior  again  when  we  discuss  convergence  zones  in  acoustic 
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Figure  4-14.  Trapping  in  an  elevated  duct  with  a parabolic  index  profile. 
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propagation.  The  spatial  repeat  frequency  in  this  instance  is 

A:  = («2 • (4.48) 

There  are  a limited  number  of  computer  codes  available  to  assist  in  calculating  the  effects 
of  refractive  propagation  on  sensor  systems.  One  of  the  more  widely  available  codes  is  EREPS 
(Engineer’s  Refractive  Effects  Prediction  System). [5]  In  addition  to  the  duct  database  (SDS) 
contained  in  the  code,  EREPS  has  several  parts:  a ray  tracing  program  (RAYS),  path  propagation 
loss  programs  (PROPR  and  PROPH),  and  a graphical  program  (COVER)  for  predicting  spatial 
detection  coverage.  EREPS  was  intended  to  assist  users  in  assessing  the  impact  of  lower  atmosphere 
propagation  effects  on  radars,  electronic  warfare  systems,  and  communications  links.  It  is  applicable 
over  the  spectral  range  from  100  MHz  to  20  GHz.  The  models  account  for  multipath  (see  Chapter 
5),  diffraction,  refraction,  evaporation  and  surface-based  ducts,  water  vapor  absorption,  and  sea 
clutter  for  horizontally-homogeneous  atmospheric  conditions.  The  ray  tracing  program  (RAYS) 
takes  a user-defined  refractivity  profile  (consisting  of  up  to  1 4 regions  of  linear  refractivity  gradient) 
and  uses  repeated  application  of  Snell’s  law  to  project  the  ray  path. 

All  of  the  preceding  effects  occur  in  the  microwave  region  of  the  spectrum.  In  the  visible 
and  infrared  region  of  the  spectrum,  the  same  ducting  and  other  refractive  phenomena  occur. 
However,  there  are  some  differences.  Most  of  these  have  to  do  with  the  sources  of  refractive 
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anomalies.  In  the  microwave  region  we  found  that  water  vapor  is  the  driving  force  behind 
anomalous  refraction.  In  the  visible  and  infrared  we  shall  see  that  water  vapor  variations  are  much 
less  significant  a driver  than  are  temperature  variations. 

Close  examination  of  Eq.  (4.2)  and  (4.3)  shows  that  in  the  visible  and  infrared,  the  sensitivity 
of  N to  temperature  changes  (AA^^ per  Kelvin)  is  roughly  four  times  the  sensitivity  to  water  vapor  (AN 
per  mbar  H2O).  For  example 


dN 

dT 


/7^=1013mbar; 


-82190 

rji2 


(4.49) 


versus 


dN 


^Ph20 


p^=1013mbar;  r=273.15K;  X=\fim 


« -0.254 . 


(4.50) 


This  is  a serious  difference  between  the  visible  and  the  microwave  region.  Thus,  reffactivity 
gradients  are  much  more  likely  to  be  driven  by  temperature  gradients  than  by  water  vapor  gradients. 
Subrefractive  and  superrefractive  (even  trapping)  layers  will  still  exist  but  they  will  be  caused  by 
different  circumstances. 

A negative  temperature  gradient  is  commonly  produced  over  hot  surfaces  (the  surface  is 
hotter  than  the  air  temperature)  and  will  produce  a strongly  positive  refractivity  gradient.  This 
produces  subrefractive  behavior.  A subrefractive  atmosphere  will  produce  inferior  mirages. [8]  In 
an  inferior  mirage,  an  elevated  object  appears  lower  than  normal.  An  inferior  mirage  in  which  the 
sky  is  viewed  as  being  “lower”  than  the  surface  of  a highway  (and  is  invariably  perceived  as  water 
on  the  road)  is  perhaps  the  most  commonly  observed  manifestation  of  an  inferior  mirage.  Other 
manifestations  can  appear  as  terrain  features  being  “reflected”  by  a flat  surface  such  as  water.  The 
refractive  behavior  leading  to  an  inferior  mirage  is  illustrated  in  Figure  4-15. 

A positive  temperature  gradient  (a  temperature  inversion)  can  be  produced  over  cold 
surfaces.  If  the  gradient  is  strong  enough  then  superrefractive  or  even  trapping  behavior  occurs. 
Super-refractive  conditions  produce  superior  mirages. [8]  Less  common  than  inferior  mirages, 
superior  mirages  can  be  much  more  striking.  Objects  can  appear  to  float  in  the  air.  Multiple  images 
of  an  object  can  appear;  some  of  which  are  inverted  or  grossly  distorted.  Sometimes  small  surface 
height  variations  are  magnified  into  huge  pillars  that  can  be  mistaken  for  structures  or  forbidding 
terrain  features.  These  “Fata  Morgana”  are  awe-inspiring  examples  of  superior  mirages.  The 
refractive  conditions  that  can  produce  a simple  superior  mirage  are  shown  in  Figure  4-16.  More 
complicated  mirages  result  when  several  superrefractive  and  normal  or  subrefractive  layers  occur 
at  different  altitudes. 
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Atmospheric  Turbulence 

Atmospheric  turbulence  has  two  different  but  somewhat  related  definitions.  The  first 
definition  relates  to  medium-scale  atmospheric  motions  more  properly  called  wind  shears. 
“Turbulence”  produced  by  wind  shear  is  the  effect  felt  by  airline  passengers  as  their  aircraft 
encounters  spatially  adjacent  updrafts  and  downdrafts.  The  rapidly  changing  accelerations  produced 
by  the  winds  are  felt  as  bumps,  lurches,  and/or  violent  shaking.  The  atmospheric  motions  may  be 
produced  by  airflows  deflected  by  mountains,  waves  at  the  boundaries  of  jet  streams,  circulation 
associated  with  storms,  or  rising  columns  of  solar  heated  air  interspersed  with  the  descending 
currents  of  colder  air  required  to  maintain  rough  atmospheric  pressure  equilibrium.  These  columnar 
motions  are  the  largest  scales  of  the  phenomenon  that  affects  sensor  systems.  The  second  definition 
of  atmospheric  turbulence,  which  is  the  one  we  are  concerned  with  in  this  section,  is  the  turbulent 
mixing  of  eddies  of  air  with  slight  temperature  differences  (typically  AT  ~ 1 K).  The  temperature 
changes  produce  slight  refractive  index  variations  (An  ~ 1 0'^)  which  in  turn  produce  random  lensing 
and  interference  effects  in  electromagnetic  radiation. 

Atmospheric  turbulence  is  produced  by  several  effects.  However,  the  strongest  driver  is 
solar  heating  of  the  ground.  Sunlight  heats  the  ground  to  temperatures  that  may  be  considerably 
above  the  ambient  air  temperature.  It  is  not  uncommon  for  the  ground  temperature  to  exceed  60  °C 
and  possible  for  it  to  exceed  90  °C  under  conditions  of  the  strongest  solar  irradiance.  The  hot 
surface  heats  the  air  in  immediate  contact.  This  superheated  air  rises  rapidly.  Consider  air  that  is 
heated  only  30  °C  hotter  than  the  ambient  air  temperature.  If  we  consider  average  lapse  rate 
conditions  (6  °C  per  km),  the  adiabatic  cooling  of  a rising  column  of  air  that  was  originally  30  °C 
above  ambient  will  reach  the  same  temperature  as  the  surrounding  air  at  an  altitude  of  10  km.  Thus 
we  see  that  the  solar  heated  air  can  easily  rise  up  to  the  tropopause.  If  the  heated  air  rises,  some 
amount  of  colder  air  must  fall  to  replace  the  rising  hot  air  and  prevent  formation  of  a vacuum.  The 
cold  air  cannot  diffuse  through  the  rising  hot  air  so  it  tends  to  form  descending  columns.  Similarly 
the  hot  air  will  be  channeled  into  rising  columns.  These  columns  will  be  initially  be  spatially 
distinct  and  adjacent  to  each  other. 

However,  there  is  a hydrodynamic  phenomenon  known  as  the  two-stream  instability  that 
rapidly  affects  both  the  rising  and  falling  columns.  The  two-stream  instability  occurs  when  a stream 
of  fluid  with  one  density  passes  through  a fluid  with  a different  density.  In  this  instance,  any 
perturbation  on  the  surface  of  one  stream  with  grow  exponentially  in  time.  The  growth  of  these 
perturbations  ultimately  causes  the  stream  to  break  up  into  globules.  The  most  common  visual 
manifestation  of  two-stream  instability  is  found  in  a common  garden  hose.  When  a strong  stream 
of  water  is  emitted  from  the  hose,  it  is  initially  a smooth  cylinder  of  water.  However,  after  only  a 
few  feet,  the  smooth  cylinder  breaks  up  into  dozens  of  droplets  of  varying  sizes.  This  is  the  result 
of  the  two-stream  instability  (the  water  is  one  stream,  the  air  constitutes  the  other).  The  same  effect 
causes  the  rising  hot  air  columns  and  the  falling  cold  air  columns  to  break  up  into  irregularly  shaped 
“turbules”  of  hot  air  and  cold  air.  Some  of  these  turbules  interact  with  each  other  and  break  up  into 
smaller  and  smaller  turbules.  The  process  of  interaction  imposes  a degree  of  random  motion  onto 
the  dominant  streaming  motion,  while  winds  will  impart  lateral  motions  to  the  turbules. 
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Any  condition  which  produces  air  hotter  than  the  ambient  temperature  will  produce 
turbulence.  Direct  solar  heating  produces  strong  turbulence.  However,  even  at  night,  the  thermal 
inertia  of  the  ground  may  keep  it  warmer  than  the  ambient  air.  This  will  also  produce  turbulence. 
However,  the  strength  of  the  turbulence  depends  on  the  maximum  AT  that  is  produced.  Nighttime 
turbulence  is  almost  never  as  strong  as  that  produced  by  solar  heating.  By  the  same  token,  if  the  sun 
temporarily  goes  behind  a cloud,  the  direct  solar  heating  is  reduced  and  the  turbulence  strength  will 
fall  dramatically.  On  overcast  days,  turbulence  strength  may  not  exceed  nighttime  levels.  Fires  or 
other  strong  heat  sources  will  produce  intense  turbulence  although  only  above  the  region  that  is 
burning  or  has  recently  burnt.  Turbulence  over  water  is  usually  much  weaker  than  turbulence  over 
the  land.  Ground  absorbs  almost  all  of  the  solar  radiation  right  at  the  surface;  but  sunlight  can 
penetrate  meters  into  water.  The  sea  surface  is  not  strongly  heated  by  sunlight,  so  there  is  no  strong 
temperature  difference  produced.  Turbulence  over  water  is  typically  less  than  nighttime  turbulence 
over  land. 

The  interaction  of  wind  with  terrain,  vegetation,  and  structures  will  also  produce  atmospheric 
turbulence.  Dynamic  (wind-driven)  turbulence  is  typically  weaker  and  less  persistent  than  solar- 
driven  turbulence.  The  eddies  produced  by  dynamic  turbulence  do  not  have  significant  temperature 
differences,  just  pressure  differences.  The  pressure  disturbances  will  equalize  more  quickly  (decay 
times  are  typically  less  than  a minute  for  disturbances  produced  by  structures  smaller  than  a few  tens 
of  meters)  than  the  temperature  differences.  For  this  reason  dynamic  turbulence  is  not  significant 
at  altitudes  much  above  the  altitude  of  the  highest  up-wind  obstructions.  It  is  also  not  significant 
if  the  optical  paths  of  interest  lie  more  than  a few  hundred  meters  downwind  from  vegetation  sources 
of  turbulence  (forested  areas).  High-speed  flow  around  airborne  platforms  can  become  turbulent. 
The  initially  laminar  flow  along  the  airframe  can  become  unstable  and  breakup.  Such  boundary 
layer  turbulence  is  limited  in  thickness,  but  its  location  right  at  sensor  apertures  on  board  the 
platforms  can  cause  problems.  Because  of  the  complexity  of  the  problem  and  its  analysis  we  will 
not  address  boundary-layer  turbulence  beyond  the  mention  of  its  potential  impact. 

Turbulent  eddies  will  range  in  size  from  millimeters  (the  so-called  “inner  scale”)  to  tens  or 
hundreds  of  meters  (the  so-called  “outer  scale”).  The  larger  scales  only  become  apparent  at  heights 
comparable  to  their  diameters.  Convective  mixing  coupled  with  adiabatic  lapse  causes  the 
turbulence  to  diminish  in  strength  as  the  altitude  increases.  We  will  return  to  the  altitude  depend- 
ence of  turbulence  in  a few  paragraphs.  Drift  dominates  the  evolution  of  the  atmospheric  eddies. 
Convective  rise  or  wind  produces  this  drift.  As  a consequence,  the  coherence  time  of  the  atmo- 
sphere as  viewed  by  a typical  sensor  is  of  the  order  of  milliseconds.  That  is,  the  ensemble  of 
turbules  viewed  by  a sensor  at  one  instant  of  time  will  be  completely  different  from  the  ensemble 
of  turbules  viewed  by  that  sensor  a few  milliseconds  earlier  or  a few  milliseconds  later. 

Atmospheric  turbulence  affects  electromagnetic  fields  during  the  propagation  of  those  fields. 
The  Huygens-Fresnel  equation  (Eq.  (2.58))  describes  the  propagation  of  electromagnetic  fields  under 
the  influence  of  diffraction.  To  describe  the  effects  of  turbulence  on  propagation,  we  need  to  modify 
(extend)  the  Huygens-Fresnel  equation.  The  unmodified  equation  has  the  form 
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(4.51) 
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To  accommodate  the  effects  of  turbulence  we  add  another  term  to  the  equation  and  obtain  the 
Extended  Huygens-Fresnel  Equation 
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X exp[  j(xo  ,y^,x,y)  + i ^(xq  , Jq  , x,  j)]} 


This  added  term  has  a real  part  and  an  imaginary  part.  The  argument  of  the  real  part  x is  called  the 

log-amplitude  ( is  an  amplitude  factor,  so  x is  the  log  of  the  amplitude  - the  log-amplitude).  The 
imaginary  part  acts  like  any  other  phase  term  (typically  an  aberration  or  distortion)  in  the  diffraction 
equation.  The  effects  of  the  two  different  parts  of  the  turbulence  extension  will  become  apparent 
shortly. 


Atmospheric  turbulence  can  affect  sensors  in  a number  of  ways.  The  potential  effects  of 
atmospheric  turbulence  on  sensor  performance  are  listed  in  T able  4-3 . Of  these  effects,  scintillation 


Table  4-3.  Potential  effects  of  atmospheric  turbulence  on  sensor  performance. 

SCINTILLATION 

* TRANSMITTER-TO-TARGET  PATH 

* TARGET-TO-RECEIVER  PATH 
TRANSMIT  BEAM  SPREAD 

* INSTANTANEOUS  BEAM  BREAKUP 

* BEAM  WANDER 
RECEIVER  ANGULAR  SPREAD 

* FIELD  OF  VIEW  MISMATCH 

* lAAAGE  SPECKLE 

* lAAAGE  WANDER 
RECEIVER  COHERENCE  LOSS 
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is  related  to  the  log-amplitude.  Beam  spread,  angular  spread,  and  coherence  loss  are  associated  with 
the  turbulent  phase  term. 

One  of  the  most  visible  manifestations  of  turbulence  is  scintillation  (fluctuations  in  intensity 
of  a source).  The  twinkling  of  stars  in  the  early  evening  is  the  effect  of  scintillation  on  the  human 
visual  sensor.  Scintillation  can  occur  on  any  propagation  path  through  the  atmosphere.  If 
scintillation  occurs  on  the  transmitter-to-target  path  then  the  illumination  may  fluctuate  strongly 
from  point  to  point  on  the  target.  If  the  target  has  a few  strong  scatterers,  the  turbulent  fluctuations 
in  the  illumination  of  those  scatterers  may  overwhelm  the  fluctuations  induced  by  the  target  alone. 
If  the  target  is  composed  of  many  small  scatterers,  there  may  be  some  averaging  of  the  turbulent 
fluctuations  by  the  target.  If  the  fluctuations  occur  on  the  return  path,  they  will  add  to  the  target 
fluctuations  and  cannot  be  averaged  over  the  aperture. 

Phase  perturbations  of  the  radiation  field  cause  beam  spread,  receiver  angular  spread,  and 
coherence  loss.  A coherent  beam  of  radiation  can  undergo  spreading  due  to  turbulence  that  is 
greater  than  that  produced  by  diffraction.  The  total  beam  spreading  will  have  three  components:  a 
component  due  to  diffraction,  an  instantaneous  breakup  of  the  beam  into  high-field  and  low-field 
regions,  and  a slower  beam  wander.  Beam  wander  and  beam  breakup  will  be  roughly  comparable 
in  magnitude  to  each  other.  That  is,  the  range  of  angles  through  the  beam  wanders  will  be  roughly 
equal  to  the  turbulent  angular  spread  observed  in  the  broken  up  beam.  The  breakup  pattern  will 
change  on  tens  of  milliseconds  time  scales,  the  same  time  scale  as  the  beam  wander. 

Beam  spreading  results  from  angular  phase  perturbation  imposed  on  a transmitted  beam  by 
turbulence  acting  near  the  source  of  the  beam.  Receiver  angular  spread  results  from  angular  phase 
perturbations  imposed  on  a received  beam  by  turbulence  acting  near  the  receiver.  The  high  spatial 
frequency  components  (which  correspond  to  those  causing  beam  breakup  on  transmit)  act  to  smear 
the  received  signal  over  a large  image  area  into  a random  pattern  of  bright  areas  and  dark  areas  - 
a speckle  pattern.  The  low  spatial  frequency  components  act  to  cause  the  speckle  pattern  to  wander 
about  the  image  plane  (the  received  signal  analog  of  transmit  beam  wander).  Both  effects  can  cause 
some  of  the  signal  to  miss  the  area  delineated  by  the  detectors.  This  effect  called  field  of  view 
mismatch  results  in  reduced  signal  strengths. 

A final  phase  perturbation  effect  can  occur  in  coherent  detection  (heterodyne)  systems.  The 
random  phase  variations  imposed  on  the  signal  will  reduce  the  spatial  coherence  of  that  signal  and 
reduce  the  efficiency  with  which  a smooth  local  oscillator  can  mix  with  the  signal.  It  is  not  possible 
to  predict  the  phase  variations  in  advance,  so  it  is  not  possible  to  match  the  local  oscillator  to  the 
signal,  which  is  necessary  for  efficient  heterodyne  conversion. 

Both  the  phase  and  amplitude  perturbations  can  be  characterized  as  Gaussian  processes 
whose  magnitude  depends  on  a quantity  C/,  the  refractive  index  structure  coefficient.  The 
refractive  index  structure  coefficient  describes  the  spatial  variability  of  the  refractive  index.  Because 
the  refractive  index  variations  are  caused  by  temperature  variations,  measurements  are  more 
common  made  of  the  temperature  structure  coefficient  Cj  defined  by 
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with  units  of 


(4.53) 


{T(r,)-T(7,f 

'^T  - _|2/3 

h-nl 

where  r(r,)  is  the  temperature  at  position  r,  and  the  < > brackets  denote  an  ensemble  average.  Cj 
is  determined  just  as  it  is  defined,  by  making  simultaneously  temperature  measurements  at  two 
points  and  carrying  out  the  averaging  over  time.  The  temperature  structure  coefficient  is  related  to 
the  refractive  index  structure  coefficient  by  the  relation 
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Upon  examination  of  Eq.  (4.2)  for  the  refractive  index  it  can  be  shown  that  the  temperature 
derivative  takes  the  form 

AU)x\0-^  , 

- ^ \ units  of  K (4.55) 

dT 

where  ^(T)  varies  from  approximately  86  at  a visible  wavelength  of  500  nm  to  somewhat  less  than 
80  at  an  infrared  wavelength  of  1 0 pm.  The  dispersion  of  refractive  index  in  air  can  almost  always 
be  considered  negligible  and  a single  value  of  ^ (a  choice  ^ = 83  is  no  better  or  no  worse  than  any 
other  choice  in  the  range  of  80-86)  selected. 

Limited  data  has  been  obtained  on  the  temporal  and  spatial  variability  of  the  refractive  index 
structure  coefficient,  although  there  is  enough  to  make  working  generalizations.  Table  4-4  shows 
data  collected  at  3m  height  above  ground  in  upstate  New  York  during  the  months  of  February  to 
May.  [9]  A number  of  samples  were  obtained  each  hour  of  every  day.  Average  daytime  turbulence 
levels  (as  characterized  by  C^^)  are  seen  to  be  around  10  '^  Average  nighttime  values  are 

roughly  10  '"*  m'^^^  The  units  of  the  structure  coefficient  appear  strange.  Nevertheless,  if  the  values 
associated  with  these  units  are  inserted  into  any  of  the  equations  that  follow,  then  the  calculated 
results  will  have  the  correct  units.  The  values  shown  in  Table  4-4  are  consistent  with  other 
measurements,  summarized  in  Reference  10. 

Theoretical  work  has  shown  that  solar-driven  turbulence  should  have  an  altitude  dependence 
that  allows  to  decrease  as  the  altitude  to  the  -4/3  power.  [1 1]  In  the  absence  of  the  solar  driver, 
the  turbulence  should  rapidly  dissipate  to  lower  overall  strengths  and  with  a less  pronounced  falloff 
with  altitude.  The  data  in  Table  4-4  support  this  conclusion.  Type  I statistics  are  fair  weather,  clear 
sky  data  - days  where  a prominent  solar  driver  is  anticipated.  The  all  weather  statistics  includes 
overcast  days  and  days  with  rain  and/or  snow.  Note  that  the  all-weather  values  are  closer  to 
nighttime  levels  than  to  the  Type  I daytime  levels. 
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Table  4-4.  Monthly-averaged  turbulence  statistics  collected  at  3m  height. 


LOG  Cp  STATISTICS 
(TYPE  1) 

LOG  Cp  STATISTICS 
(ALL  WEATHER) 

MEAN 

STD.  DEV. 

MEAN 

STD.  DEV. 

FEB 

NIGHT 

-14.13 

0.68 

-14.12 

0.68 

DAY 

-12.76 

0.41 

-13.63 

0.76 

MAR 

NIGHT 

-14.33 

0.60 

-14.05 

0.75 

DAY 

-12.70 

0.52 

-13.12 

0.82 

APR 

NIGHT 

-13.67 

0.43 

-13.64 

0.51 

DAY 

-12.79 

0.62 

-12.82 

0.63 

MAY 

NIGHT 

-13.49 

0.54 

-13.54 

0.57 

DAY 

-12.56 

0.37 

-12.85 

0.63 

TYPE  I:  CLEAR  SKY;  FAIR  WEATHER 


In  the  majority  of  his  work  on  turbulent  effects  the  author  has  used  the  following  general 
values  for  estimating  turbulence  strength: 


DAY 

3m  Height,  Average  Terrain,  Clear  Weather 

(3  m)  = 10’^ 

DAY 

3m  Height,  Desert,  Clear  Weather 

Cp^  (3  m)  = 10  ’^  m'^^^ 

DAY 

3m  Height,  Any  Terrain,  Adverse  Weather 

Cp^  (3  m)  = 10’^  m-^^3 

DAY 

3m  Height,  Over  Water,  Any  Weather 

Cp^  (3m)  = 10  ’^  m'^^^ 

NIGHT 

3m  Height,  Any  Terrain,  Any  Weather 

Cp^  (3m)  = 10  ’"‘  m'^^^ 

NIGHT 

3m  Height,  Over  Water,  Any  Weather 

Cp^  (3  m)  = 10'®  m-^^3 

The  author  also  uses  the  following  altitude  dependences  below  100  m: 


DAY 

Any  Terrain,  Clear  Weather 

Cp^  (h)  = Cp^  (3m)  (h/3)-^^^ 

DAY 

Any  Terrain,  Adverse  Weather 

Cp^  (h)  = Cp^  (3m)  (h/3)-^^^ 

DAY 

Over  Water,  Any  Weather 

Cp^  (h)  = Cp^  (3m)  (h/3)-^^^ 

NIGHT 

Any  Terrain,  Any  Weather 

Cp^  (h)  = Cp^  (3m)  (h/3)-^^^ 
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and  the  following  altitude  dependence  above  100  m:[12] 

Cl(h)  = 8.2  X x (4.56) 

where  U is  the  root-mean-squared  wind  speed  averaged  over  the  5000-20000  m altitude  regime.  U 
has  a mean  value  of  27  m/s  and  a standard  deviation  of  9 m/s.  The  altitude  dependence  of  Eq.  (4.56) 
is  technically  only  valid  above  1000  m or  so,  however,  extrapolation  to  lower  altitudes  leads  to  a 
result  almost  equal  to  the  extrapolations  of  the  lower  altitude  profdes  to  higher  altitudes.  In  the  1 00- 
1000  m altitude  regime,  the  differences  between  the  different  extrapolations  are  relatively  small. 
The  combined  dependence  is  shown  in  Figure  4-17.  With  this  knowledge  of  the  values  of  C/  and 
its  spatial  dependence,  we  can  proceed  to  look  at  specific  turbulence  effects. 

The  log-amplitude  % (after  exponentiation)  modifies  the  amplitude  of  the  electric  field  and 
consequently  the  magnitude  of  the  intensity 

I = (4.57) 

Fluctuations  in  the  log-amplitude  translate  into  intensity  fluctuations.  These  fluctuations  are  called 


Figure  4-17.  Generic  altitude  dependence  of  atmospheric  turbulence. 
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scintillation.  Figure  4-18  shows  the  effect  of  scintillation  on  a laser  beam  courtesy  of  J.  Shapiro  at 
M.I.T.  An  initially  uniform  intensity  beam  was  transmitted  at  night  from  the  roof  of  one  building 
to  the  roof  of  another  building  several  kilometers  distant.  The  turbulence  amplitude  modulated  the 
beam  with  a random  pattern  of  more  intense  regions  and  less  intense  regions.  The  peak-to-valley 
range  of  the  fluetuations  inereases  as  propagation  distance  increases  and  as  the  turbulence  strength 
(Cn^)  inereases.  Note  that  the  seintillation  is  apparent  in  both  the  focused  and  unfoeused  beams.  The 
beams  in  Figure  4-18  have  also  propagated  far  enough  to  show  signifieant  beam  spreading.  The 
foeal  spot  of  the  unspread  beam  would  have  been  only  slightly  larger  than  the  size  of  an  individual 
“speckle”. 

Turbulenee  tends  to  produce  fluetuations  in  the  log-amplitude  that  are  Gaussianly  distributed 

pAx)  = (2^0  e (4.58) 

with  mean  value 

{x)  = (4.59) 

and  variance 

var(;i^)  = <j\  (4.60) 


Figure  4-18.  Photographs  of  seintillation  on  a laser  beam  taken  at  the  aperture 
of  a colleeting  telescope  and  at  the  focal  plane  of  that  teleseope. 

Aperture  Focus 
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The  key  quantity  is  called  the  log-amplitude  variance.  It  can  be  calculated  using  the  equation 

(4-61) 

where 

k = lnlX  (4.62) 

In  this  equation  we  have  defined  a normalized  path  length 

= z!  R = Normalized  distance  along  path  (4.63) 

This  permitted  all  dimensioned  quantities  except  the  refractive  index  structure  coefficient  to  be 
brought  outside  the  integral.  The  wavelength  and  path  length  dependence  become  explicit.  This 
normalization  will  be  used  for  all  turbulence  calculation.  If  the  refractive  index  structure  coefficient 
is  constant  over  the  path,  then  the  equation  for  the  log-amplitude  variance  simplifies  to 

= 0.124  Cl  k^'^  R'''^  (4.64) 

When  scintillation  fluctuations  become  large  enough  they  undergo  a phenomenon  called 
saturation  of  scintillation,  that  is,  even  though  continues  to  grow,  t7^  stops  growing  and  levels 

off  at  a value  of  roughly  0.5.  However,  by  the  time  saturation  of  scintillation  has  been  reached, 
turbulence  effects  are  highly  significant  and  sensor  performance  is  usually  severely  degraded.  An 
analysis  of  the  effects  of  scintillation  on  laser  radar  detection  performance  is  presented  in  Chapter 
1 1 . The  degrading  effects  of  scintillation  on  laser  radar  imaging  performance  are  analyzed  in 
Chapter  14.  In  light  of  these  results,  it  is  a worthwhile  objective  to  try  to  keep  the  log-amplitude 
variance  at  least  an  order  of  magnitude  below  saturation.  If  scintillation  is  not  saturated,  then  the 
spatial  intensity  correlation  distance  (scale  size)  of  the  scintillation  light  and  dark  patches  is  given 
by 

dc  = (XRf^ . (4.65) 

Figure  4-19  shows  the  turbulence  strength  dependence  of  the  log-amplitude  variance  at 
10.6  pm  over  a horizontal  path  (constant  C„^).  Figures  4-20  and  4-21  show  the  wavelength 
dependence  of  the  log-amplitude  variance  for  the  typical  daytime  and  nighttime  values  of 
turbulence.  It  should  be  noted  that  under  daytime  conditions  it  is  relatively  easy  for  the  log- 
amplitude  variance  to  become  saturated  over  horizontal  path  lengths  of  tactical  interest  to  sensor 
systems.  At  night,  the  reduced  turbulence  strength  permits  propagation  to  considerably  longer 
distances  before  saturation  occurs. 

Beam  spread  and  angle  of  arrival  fluctuations  are  characterized  by  the  atmospheric  coherence 
length  Pq.  The  coherence  length  depends  critically  on  whether  the  strongest  turbulence  is  near  the 
beginning  of  a propagation  path  or  near  the  end.  Thus,  we  can  define  one  coherence  length  for  paths 
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Figure  4-19.  Turbulence  strength  dependence  of  the  log-amplitude  variance. 
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Figure  4-20.  Wavelength  dependence  of  the  log-amplitude  variance  - 
typical  daytime  turbulence  strength  of  = 10  '^  m'^^^ 
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Figure  4-21.  Wavelength  dependence  of  the  log-amplitude  variance  - 
typical  nighttime  turbulence  strength  of  = lO  '"' 
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involving  transmitted  signals 


Pqt  ~ 


-1  5/3 

2.9\e  R\diCl(i)(\-C) 


-1-3/5 


(4.66) 


and  another  for  paths  involving  received  signals 


Pqr  ~ 


(4.67) 


Note  the  different  dependence  on  ^ in  the  integrals.  Eq.  (4.66)  weights  small  values  of  ^ more 
strongly  than  large  values,  while  Eq.  (4.67)  does  the  opposite.  The  atmospheric  correlation  length 
for  a path  is  the  transverse  separation  between  two  points  on  a beam  of  electromagnetic  radiation 
required  for  the  phase  of  the  radiation  to  become  uncorrelated.  That  is,  the  largest  patch  on  a beam 
that  can  be  considered  to  have  a uniform  phase  is  p„. 


Beam  spread  is  association  with  the  transmission  of  a beam.  Thus,  the  degree  of  beam 
spreading  will  depend  on  the  transmitted-path  correlation  length.  The  physical  half-angle  by  which 
the  beam  will  spread  due  to  turbulence  is  given  by 
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^ Bs  ~ 0-3  84 A / . 


(4.68) 


The  beam  will  also  spread  by  normal  diffraction  by  a half-angle  If  the  aperture  is  circular  with 
diameter  D and  uniformly  illuminated,  the  diffraction  beam  spread  can  be  found  from  the  Airy 


function  and  has  half-angle 

= 1.22A/Z).  (4.69) 

The  total  beam  spread  half-angle  will  be  given  by 

Turbulence  beam  spreading  will  equal  diffraction  beam  spreading  if 

(l.22/0.384)/?o^  = 3.181/?or  = D . (4.71) 

For  this  reason  the  quantity  rg  , defined  by 

ro  = 3.181/?o  (4.72) 


where  pg  is  either  the  transmitted-path  or  received-path  coherence  length,  as  appropriate,  is  called 
Fried’s  coherence  diameter. [13] 

On  received  paths,  the  atmospheric  phase  variations  will  cause  angle  of  arrival  variations. 
The  magnitude  of  these  variations  can  be  determined  from  the  equation 

d,,=03&4A/p,,.  (4.73) 

If  a horizontal  path  (uniform  C/)  is  assumed,  then  the  integrals  in  Eqs.  (4.66)  and  (4.67) 
simplify  and  give  a single  result  for  the  coherence  length 

Pot  = PoR  = . (4.74) 

Figure  4-22  shows  the  turbulence  strength  dependence  of  the  coherence  length  for  a horizontal  path 
and  a wavelength  of  10.6  pm.  Figures  4-23  and  4-24  show  the  daytime  and  nighttime  dependence 
of  the  coherence  length  on  wavelength.  Once  again  it  should  be  noted  that  rather  small  coherence 
lengths  (<  0.1  m)  are  obtained  at  moderately  short  ranges  (<  10  km)  in  a number  of  cases.  This 
means  that  turbulent  beam  spreading  might  adversely  affect  the  performance  of  any  sensor  under 
those  conditions.  Beam  spread  may  reduce  the  power  density  that  illuminates  the  target  or  the  power 
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Figure  4-22.  Turbulence  strength  dependence  of  the  horizontal  path  coherence  length  at  10.6  pm. 


Figure  4-23.  Wavelength  dependence  of  the  horizontal  path  atmospheric  coherence  length  - 
typical  daytime  turbulence  strength  of  = 10  '^  m'^^^ 
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Figure  4-24.  Wavelength  dependence  of  the  horizontal  path  atmospheric  coherence  length  - 
typical  nighttime  turbulence  strength  of  = lO  '"' 
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than  can  be  focused  onto  a detector.  It  will  reduce  the  effective  angular  resolution  of  the  sensor. 
If  the  sensor  uses  coherent  detection  and  the  coherence  length  becomes  smaller  than  the  receiver 
aperture,  then  the  random  turbulent  phase  variations  will  significantly  reduce  the  heterodyne  mixing 
efficiency. 

The  horizontal  path  results  close  to  the  ground  (3  m altitude)  are  essentially  worst  case 
predictions  for  real  sensor  systems.  Because  turbulence  strength  falls  off  with  increasing  altitude, 
any  path  characteristic  that  elevates  part  of  the  path  (even  by  a few  meters)  reduces  the  effects  of 
atmospheric  turbulence.  For  example,  few  places  in  the  world  are  perfectly  flat.  In  order  to  have 
a 10  km  line  of  sight,  the  observer  and  the  target  will  typically  have  to  be  on  the  tops  of  local  peaks. 
The  valley(s)  in  between  will  cause  the  over  those  valleys  to  be  lower  than  at  the  local  peaks. 

This  in  turn  will  cause  cr^  to  be  much  lower  and  to  be  much  larger  than  the  horizontal  path 

estimates.  A common  situation  is  when  either  the  target  or  the  sensor  is  elevated  on  a large  structure 
or  on  an  airborne  platform.  In  such  slant  paths,  one  end  of  the  path  is  high  above  the  surface,  even 
the  other  end  may  be  near  the  surface.  Nevertheless,  slant  paths  provide  significant  reductions  in 
turbulence  effects. 

The  impact  of  slant  paths  can  be  easily  calculated.  Consider  the  geometry  illustrated  in 
Figure  4-25.  If  we  use  the  previously  defined  normalized  path  distance  variable  then  the  daytime 
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Figure  4-25.  Geometry  of  a simple  slant  path. 


TARGET 


profile  of  turbulence  strength  with  altitude  can  be  written  as 


1,  h,-h 


n-4/3 


= C^,{H„)[i  + AC] 


-4/3 


(4.75) 


where  we  have  defined  a constant  A which  defines  the  slant  path 

A = AlzEs.  , (4,76) 

Ho 

All  of  the  path  dependence  is  now  contained  in  the  simple  quantity  [X+AC^Y'^.  In  a similar  fashion 
the  nighttime  turbulence  profile  can  be  written  as 


1, 


n-2/3 


= Cl(Ho){\  + A(] 


-2/3 


(4.77) 


Consider  first  daytime  turbulence  and  the  log-amplitude  variance.  Substituting  Eq.  (4.75) 
into  Eq.  (4.61)  and  rearranging  terms  we  obtain 


142 


cr^  = 0.56  CI[Hq)  [\+ ^5/6^j_^^5/6 

= 0.124 

r 1 n (4-V8) 

4516^1^  [\  + A^Y^'^  ^5/6^j_^^5/6 

= 0.124  Cl{Ho)  k'‘^  F,s{A) 

Note  that  all  of  the  slant  path  characteristics  can  be  factored  into  a simple  function  Fj^A)  which 
multiplies  the  horizontal  path  result.  The  path  function  Fjg  is  plotted  in  Figure  4-26.  To  determine 
the  impact  of  a slant  path  on  the  log-amplitude  variance,  all  the  user  needs  to  do  is  perform  a 
horizontal  path  calculation,  determine  the  relevant  value  of  A,  look  up  the  value  of  in  the 

figure,  and  multiply  the  horizontal  path  value  by  the  value  of  F;^.  Similar  calculations  can  be 
performed  for  each  of  the  other  turbulence  parameters. 


Figure  4-26.  Multipliers  used  to  convert  horizontal  path  results  into  slant  path  results. 
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The  daytime  slant-path,  transmitted-path  atmospheric  coherence  length  becomes 


Pqt 


2.91  Cl(H„)eR\'d<;[\  + AiY" 


-1-3/5 


= [\.09Cl(H„)eR]  X 

1 -|-3/5 

= [\mCl{H„)eRY'F,,{A) 

The  daytime  slant-path,  received-path  atmospheric  coherence  length  becomes 

“1-3/5 


PoR  — 


2.91  Cl{H„)  eRldi[\  + A(\'^’ 


= [\.WCl(Ho)eR]  X 
2.(>19\'d([\  + A^Y^^  C 


“1-3/5 


= \mCl(H„)k^RY'FYA) 


The  nighttime  slant-path  log-amplitude  variance  becomes 


= 0.56  Cl{Ho)  [\  + ^5/6^j_^^5/6 


= 0.124 

45\6\'d^[\  + 


(4.79) 


(4.80) 


(4.81) 
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= 0.124 

The  nighttime  slant-path,  transmitted  path  atmospherie  eoherence  length  becomes 


(4.81) 


Por 


2.91  Cl{H„)  k^Rld([\  + (l  -ff  ’ 


-3/5 


[\.09  Cl(Ho)eR]  X 
2.670{<[l  + .4^f'’  (1-^) 


-1-3/5 


(4.82) 


= \mCl{H^)eRY'F,,{A) 


while  the  nighttime  slant-path,  received-path  atmospheric  coherence  length  becomes 

n-3/5 


Pqr 


2.91  Cl{H„)  + 


= [l.09C;(/7„)t^/?px 

2.67o£</4'[1 + /!<■]■“  4'*" 


= [\mCl{H„)eRY'FjA)  (4.83) 

All  six  path  functions  F,s  through  F,s  are  plotted  versus  the  path  constant  A in  Figure  4-26.  As 
anticipated  even  relatively  small  elevations  of  one  end  of  the  path  can  significantly  reduce  the  effects 
of  turbulence.  The  figure  is  valid  for  slant  paths  with  Hj  >Hq.  The  same  expressions  and  curves 
can  be  used  if  Hq  >H^  by  using  the  absolute  value  of  A instead  of  A and  using  F2S  and  F55  for 
received  paths  and  and  F6S  for  transmitted  paths  (instead  of  the  other  way  around). 

Figures  4-27  through  4-29  explicitly  show  the  effects  of  slant  paths  on  the  log-amplitude 
variance,  transmit-path  atmospheric  coherence  length,  and  receive-path  atmospheric  coherence 
length  at  10.6  pm  and  assuming  daytime  turbulence. 
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Figure  4-27.  Effects  of  daytime  slant  paths  on  turbulence  log-amplitude  variance. 
Transceiver  altitude  is  2m;  wavelength  is  10.6  pm;  Cj,^(2m)  = 10'^^ 
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Figure  4-28.  Effects  of  daytime  slant  paths  on  transmitted-path  atmospheric  coherence  length. 
Transceiver  altitude  is  2m;  wavelength  is  10.6  pm;  Cj,^(2m)  = 10‘^^ 
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Figure  4-29.  Effects  of  daytime  slant  paths  on  received-path  atmospheric  coherence  length. 
Transceiver  altitude  is  2m;  wavelength  is  10.6  pm;  C„^(2m)  = 10  '^ 
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Ionospheric  Effects 

In  the  thermosphere,  not  only  does  the  kinetic  temperature  of  the  molecules  rise  dramatically, 
the  degree  of  ionization  increases  (although  the  ionization  is  not  produced  by  the  temperature  rise). 
Solar  x-radiation  from  solar  prominences  and  flares  and  extreme  ultraviolet  radiation,  blackbody 
radiation  plus  strong  photospheric  emission  lines  such  as  Lyman-a  (121.5  nm)  and  Lyman-P  (102.5 
nm)  from  hydrogen  atoms  as  well  as  lines  from  ionized  helium  in  the  solar  photosphere,  is  absorbed 
by  photoelectric  emission  from  the  oxygen  and  nitrogen  atoms  and  molecules  in  the  upper 
atmosphere.  The  resulting  ionization,  establishing  a thick  layer  of  the  upper  atmosphere  called  the 
ionosphere,  significantly  effects  low  frequency  radio  wave  propagation.[14]-[17] 

The  ionosphere  is  characteristically  divided  into  four  significant  layers:  the  D layer,  the  E 
layer,  the  F 1 layer,  and  the  F2  layer.  Obviously  from  the  nomenclature  there  are  several  other  layers 
of  minimal  significance.  The  vertical  structure  of  these  layers  is  illustrated  in  Figure  4-30. [15] 
There  is  considerable  variation  of  these  layers  with  time.  As  the  ionosphere  results  from  direct 
ionization  produced  by  sunlight,  we  would  expect  and  indeed  find  that  daytime  ionization  levels  are 
higher  than  nighttime  levels.  There  is  also  a weaker  dependence  on  time  of  year  (varying  distance 
from  the  sun  and  solar  elevation  angle  act  to  produce  seasonal  variations  in  ionization  just  as  they 
produce  similar  changes  in  the  weather  in  the  troposhpere.  Since  there  is  a substantial  contribution 
to  the  ionization  from  solar  x-rays,  this  produces  an  expected  variation  of  ionization  with  position 
relative  to  the  1 1 -year  sunspot  cycle.  At  solar  maximum  (maximum  sunspots  and  prominences) 
there  is  higher  overall  ionization  produced  than  at  solar  minimum.  All  three  variations  (diurnal, 
annual,  and  sunspot  cycle  are  shown  in  Figure  4-30. 

The  E layer  is  a narrowly  layer  at  approximately  110  km  altitude  that  appears  to  be 
dominantly  produced  by  direct  ionization  of  molecular  oxygen  by  ultraviolet  radiation  with 
wavelengths  below  103  nm.  Secondary  contributions  come  from  ionization  of  nitric  oxide  (NO). 
The  E layer  usually  nearly  diminishes  greatly  at  nighttime.  It  is  still  apparent  in  ionization  profiles 
but  its  peak  value  is  1%  of  its  daytime  levels.  Occasionally  a nighttime  E layer  appears,  when  it  is 
correlated  with  ionization  produced  by  meteorites  or  electron  showers. 

The  D layer  lies  below  the  E layer  and  is  formed  by  x-ray  and  Lyman-a  ionization  of  nitric 
oxide  (NO).  It  is  a weak  layer  that  almost  disappears  at  night.  Its  ionization  is  so  low  that  it  seldom 
affects  propagation  for  producing  an  anomalous  absorption  of  low-frequency  radiation  (below  10 
MHz). 


The  F layers  occur  above  the  E layer  and  can  extend  to  heights  of  several  hundred 
kilometers.  Electron  production  in  the  F layers  appears  to  result  from  ultraviolet  ionization  of 
atomic  oxygen  although  ionization  of  molecular  and  atomic  nitrogen  may  also  contribute.  The  FI 
layer  seldom  appears  as  a distinct  peak.  It  is  usually  a shoulder  or  a shelf  below  the  dominant  F2 
layer.  In  the  FI  layer,  recombination  of  electrons  with  ions  and  attachment  to  neutrals  are  the 
mechanisms  that  counter  the  production  rate  (and  keep  the  ionization  finite).  As  altitude  increases, 
the  collisions  necessary  for  “electron  loss”  become  increasingly  rare.  This  causes  the  F2  layer  in 
which  upward  drift  of  electrons  out  of  the  layer  balances  the  rate  of  production. 
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Figure  4-30.  The  vertical  structure  of  the  ionosphere  showing  diurnal,  annual,  and 
1 1 -year  solar  sunspot  cycle  variations  in  electron  concentration. 
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The  primary  significance  of  the  ionosphere  on  radio  wave  propagation  comes  from  its 
refractive  index.  The  refractive  index  of  an  electron-ion  plasma  is  given  by  the  equation 


2 

n - 

1- 

p_ 

V ’"y 

nl/2 


(4.84) 


where  vis  the  frequency  of  the  electromagnetic  radiation  and  vj,  is  the  electron  plasma  frequency. 
The  plasma  frequency  (in  Hz)  is  given  by 


2 A 


1/2 


y47T  mSo  j 


= 8.987V 


1/2 


(4.85) 


where  A(,  is  the  density  (electrons  per  cubic  meter)  of  free  electrons  in  the  plasma.  Note  that  an 
electron  density  of  1 0'^  m'^  produces  a plasma  frequency  of  roughly  9 MHz  while  an  electron  density 
of  10'®  m'^  produces  a plasma  frequency  of  roughly  900  kHz. 

At  very  high  frequencies  n is  approximately  1 and  the  ionosphere  can  have  little  effect. 
However,  as  v approaches  , the  refractive  index  starts  to  drop  rapidly.  When  v<  Vp  , then  n is 
pure  imaginary,  and  the  wave  cannot  propagate.  It  is  completely  reflected.  Consider  an  ionosphere 
in  which  the  electron  density  is  zero  at  some  altitude  h„  and  increases  steadily  to  a maximum  value 
at  an  altitude  of  Now  consider  an  electromagnetic  wave  with  a frequency  less  than  the 
plasma  frequency  at  the  maximum  ionization  point.  Let  this  wave  be  radiated  at  some  angle  to  the 
vertical.  As  the  wave  propagates  outward  below  the  ionosphere,  it  experiences  only  the  propagation 
phenomena  normally  present  in  the  troposphere  (which  we  will  assume  are  negligible).  As  the  wave 
encounters  the  ionosphere,  it  encounters  a medium  with  a negative  index  gradient  (as  the  electron 
density  increases,  the  refractive  index  decreases).  This  is  a superrefractive  gradient.  The  wave  will 
begin  to  bend  away  from  the  vertical.  As  the  wave  penetrates  farther  into  the  ionosphere,  it 
experiences  an  ever-increasing  superrefractive  effect.  Depending  on  its  initial  angle  of  incidence 
and  its  frequency  its  path  may  be  sufficiently  curved  to  be  bent  back  towards  the  surface.  It  is 
essentially  “reflected”  from  the  ionosphere  (even  though  reflection  has  no  real  part  in  the  process). 
If  it  fails  to  be  reflected  by  the  first  layer  it  will  continue  to  propagate  deeper  into  the  ionosphere 
encountering  higher  layers  with  denser  ionization  and  higher  superrefractivity.  Any  ray  with 
frequency  below  the  plasma  frequency  corresponding  to  the  peak  ionization  (a  frequency  we  will 
call  the  cut-off  frequency)  will  ultimately  be  refracted  back  towards  the  surface.  The  height  at 
which  it  is  so  “reflected”  will  be  a function  of  incidence  angle,  frequency,  and  ionospheric  state. 

If  we  consider  a fixed  ionosphere  and  select  a frequency  below  the  cutoff  frequency  and 
propagate  rays  toward  the  ionosphere  with  increasing  elevation  angle  we  observe  behavior  similar 
to  that  in  Figure  4-3 1 . Low  elevation  rays  (ray  5)  reflect  off  lower  portions  of  the  ionosphere  and 
continue  back  to  the  surface  striking  the  ground  at  great  distances  determined  by  the  altitude  of 
reflection  and  the  real  spherical  geometry.  The  maximum  distance  will  be  approximately  twice  the 
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“horizon  range”  as  viewed  from  the  effective  altitude  of  the  reflection  (the  height  of  the  transmitter 
has  little  effect  on  the  maximum  range  for  realistic  transmitter  heights).  Thus,  using  Eqs  (4.23)  and 
(4.24)  we  have 


“ 260.7 km  (77  in  km)  (4.86) 

or 

Rmax  ~ 191.7  nmi  (77  in  nmi) . (4.87) 

For  reflection  at  the  E layer  (roughly  90  km  high)  we  have  a maximum  range  per  bounce  of  about 
2500  km.  For  reflection  from  the  F2  layer  (perhaps  200  km  high)  we  have  a maximum  range  per 
bounce  of  about  3700  km. 


As  the  elevation  angle  of  the  ray  increases  (ray  4)  the  path  to  the  ground  decreases  steadily. 
However,  at  some  elevation  angle  (ray  3)  corresponding  to  a ground  bounce  distance  of  about  800 
km,  the  distance  stops  decreasing  with  increasing  elevation  angle.  Rays  with  higher  elevation  angles 
(rays  1 and  2)  will  pass  through  the  first  layer  and  encounter  regions  of  weaker  bending  that  cause 
them  to  travel  larger  horizontal  distances  before  ultimately  reflecting  off  of  a higher  layer.  Thus 
there  is  a minimum  bounce  distance  of  about  800  km.  The  presence  of  two  ray  paths  to  the  same 
point  on  the  ground  (e.g.,  ray  5 and  ray  1)  could  conceptually  lead  to  interference,  but  given  the  wide 
angular  separation  this  is  not  likely  to  be  a problem  for  sensor  performance.  Only  frequencies  near 
to  the  cutoff  frequency  would  not  likely  have  a minimum  bounce  distance. 

Frequencies  above  the  cutoff  frequency  can  be  reflected  as  well  but  only  at  lower  elevation 
angles.  The  higher  the  frequency  the  weaker  the  ionospheric  refraction.  Higher  frequency  rays 
traveling  predominantly  upward  would  not  be  sufficiently  refracted  before  they  reached  the  height 
of  maximum  ionization  at  which  point  the  medium  become  subrefractive  and  the  rays  can  escape. 
Those  at  lower  elevation  angles  need  only  smaller  changes  in  curvature  for  them  to  be  returned  to 
the  surface.  The  maximum  frequency  at  which  ionospheric  bounces  will  occur  is  in  the  range  of  30- 
50  MHz  (well  above  the  cutoff  frequency  of  roughly  10  MHz  for  normal  peak  ionization  levels). 
It  has  been  found  that  the  “effective  plasma  frequency”  for  rays  of  low  elevation  angle  is  given  by 


^peff  = 
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(4.88) 


where  a is  the  angle  of  the  ray  with  respect  to  the  zenith  (90°  minus  the  elevation  angle).  When  the 
real  frequency  equals  the  effective  plasma  frequency  the  angled  ray  will  be  reflected. 
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Figure  4-31.  Variation  in  ray  paths  as  a function  of  elevation  angle  in  ionospheric  reflec- 
tion. [17] 


IONO#HERE 


The  effective  height  of  reflection  depends  on  the  ionization  profde  as  well  as  angle  and 
frequency.  As  the  ionosphere  undergoes  it  diurnal  variation,  the  effective  height  of  reflection  and 
thus  the  effective  range  of  the  bounce  will  vary  up  and  down.  Systems  using  ionospheric  bounce 
channels  are  usually  designed  to  vary  the  operating  frequency  to  keep  the  bounce  characteristics 
relatively  independent  of  time  of  day. 

In  general  most  ionospheric  effects  are  refractive  in  character.  There  is,  however,  some 
potential  for  absorption.  The  absorption  in  a weakly  ionized  plasma  can  be  determined  from  the 
equation 


a = Us ^mc)(N^vJi^(27Vvf  + v" 


(4.89) 


where  is  the  electron  density  (in  electrons/m^)  and  is  the  collision  frequency  between  electrons 
and  neutrals  (in  Hz).  Evaluating  the  constants  yields 


a = 4.6x \Q-\N^v^)  (IttvY  + v 


.1-1 


1.16x10'^  Nvjv^ 


dB/  km 
dB/km 


(4.90) 


Only  in  the  lower  layer  of  the  ionosphere  (D  layer)  is  the  collision  frequency  high  enough  to  give 
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any  appreciable  absorption.  Typical  values  for  the  collision  frequency  in  the  60-90  km  altitude 
regime  are  10^  to  10®  per  second.  Given  electron  densities  of  10^  to  10'“  m'^  in  this  same  region,  we 
find  products  in  the  range  of  10'®  to  10'^  m'^-s  '.  For  radio  frequencies  in  the  range  of  3 to  10 
MHz,  one  can  readily  calculate  attenuations  ranging  from  an  inconsequential  0.1  dB/km  to  a 
substantial  12.9  dB/km.  Because  of  the  inverse  square  dependence  on  frequency,  absorption  will 
be  negligible  at  frequencies  significantly  greater  than  10  MHz.  Even  though  electron  densities  are 
much  higher  in  the  E and  F layers,  the  collision  frequency  is  many  of  magnitude  lower  (the 
frequency  scales  with  atmospheric  pressure  which  falls  off  roughly  one  order  of  magnitude  for  each 
10  km  increase  in  altitude  above  70  km),  preventing  any  substantial  absorption. 

A final  refractive  effect  can  occur  in  the  ionosphere.  On  small  scales  (kilometers)  the 
ionosphere  is  inhomogeneous.  There  are  currents  and  eddies,  just  as  there  are  currents  and  eddies 
in  the  atmosphere.  The  refractivity  variations  associated  with  these  eddies  produces  scintillation  at 
microwave  frequencies  just  as  atmospheric  turbulence  produces  scintillation  in  the  visible  and 
infrared.  Although  there  are  no  good  predictive  tools  for  ionospheric  scintillation,  there  are  a 
few  useful  observations.  Scintillation  appears  most  severe  in  early  evening  but  can  be  noticeable 
at  any  time.  It  is  more  severe  in  equatorial  regions  than  in  polar  regions.  Scintillation  is  only  seen 
at  night  in  polar  regions.  Scintillation  amplitude  decreases  with  increasing  frequency  but  is  still 
noticeable  at  gigahertz  frequencies.  Scintillation  fades  can  exceed  20  dB.  The  characteristic  time 
scale  of  fading  tends  to  be  relatively  slow  (tens  of  seconds).  As  with  turbulence,  additional  carrier- 
to-noise  ratio  margin  may  be  required  to  overcome  the  effects  of  scintillation  fades  on  detection 
probability. 
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Problems 


4-2. 

4-3. 

4-4. 


4-5. 

4-6. 

4-7. 

4-8. 

4-9. 

4-10. 


An  incoming  cruise  missile  has  a elosure  veloeity  of  1 .4  km/sec  and  an  altitude  of  2 m above 
mean  sea  level.  An  intereeptor  missile  is  to  be  launched  to  intercept  the  cruise  missile  at  5 
km  range.  The  fire  control  system  requires  12  seconds  to  detect  and  establish  track  on  the 
ineoming  missile.  Administrative  delays  involving  the  deeision  to  engage  require  at  least 
6 seconds.  Another  2 seeonds  is  required  to  prepare  and  aetually  launeh  the  intereeptor.  The 
intereeptor  requires  5 seeonds  to  fly  out  to  the  intereept  range.  What  is  the  minimum 
acceptable  height  of  the  search  radar  in  order  to  suceessfully  aeeomplish  this  intercept? 

What  is  ducting?  What  condition  causes  it?  Note:  answer  desired  is  not  specifie  to  any  type 
of  radiation  or  wavelength  region. 


What  type  of  duct  always  occurs  over  bodies  of  water?  At  what  heights  does  the  top  of  these 
ducts  typically  occur? 


Given  the  following  eurve  of  modified  refractivity 
vs.  height  identify  the  altitude  regions  having 
ducting.  What  kinds  of  ducts  are  present? 


Deseribe  the  kinds  of  atmospheric  ducts  that  might  occur  over  land  and  outline  the  effeets 
they  might  have  on  mierowave  sensors. 

What  is  the  dominant  souree  of  atmospheric  turbulence?  Describe  briefly  how  it  produces 
turbulence. 

Quantify  the  two  primary  differenees  between  nighttime  and  daytime  turbulence. 

Name  three  major  effeets  of  turbulence  on  sensor  performance. 

What  three  parameters  determine  the  magnitude  of  all  turbulenee  effeets? 

Whieh  of  the  following  are  prominent  refraetion  effects? 

Mirages?  dueting?  beam  wander?  extinetion?  scintillation?  beam  spread? 
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4-11.  What  turbulence  effects  are  quantified  by: 

- the  log-amplitude  variance? 

- the  atmospheric  correlation  length? 

4-12.  For  HF  radiation  with  a frequency  of  2 MHz  and  using  the  data  of  F igure  4-29,  estimate  the 
heights  at  which  reflection  occurs  for  radiation  initially  radiated  at  45°  to  the  zenith. 
Calculate  results  for  noon  and  midnight  and  solar  maximum  and  solar  minimum. 

4-13.  Using  the  heights  calculated  in  the  preceding  problem,  estimate  the  maximum  ranges  that 
can  be  obtained  on  a single  bounce  or  skip. 


157 


158 


CHAPTER  5 


PROPAGATION  OF  ELECTROMAGNETIC  RADIATION.  IV. 
- OTHER  ATMOSPHERIC  AND  UNDERWATER  EFFECTS 


Contrast  Transmission 


The  observable  quantity  used  by  most  visible  and  infrared  imaging  sensors  is  the  target- 
to-background  contrast.  Contrast  can  be  defined  in  several  different  ways:[l] 


1 

^ B 

(5.1a) 

C=  , 

It  + 1 ^ 

(5.1b) 

or 

/ 

1 j 

(5.1c) 

where  Ij  is  the  signal  strength  observed  from  the  target  and  1^  is  the  signal  strength  observed  from 
the  nearby  background.  In  the  remainder  of  this  work  we  will  use  Equation  (5.1a)  unless  the 
conventions  of  a specific  application  requires  otherwise.  Exceptions  will  be  explicitly  identified. 

The  inherent  contrast  of  an  object  is  defined  as  the  contrast  of  that  object  at  zero  range  and 
is  given  by  [1],[2] 

' ' ^.(0) 

(5.2) 

The  signals  from  both  target  and  background  will  be  attenuated  by  the  intervening  atmosphere. 
They  will  also  be  augmented  by  a path  radiance  Ip{R)  that  include  atmospheric  emission  and 
radiation  scattered  into  the  line  of  sight  by  haze,  fog,  smoke,  or  other  obscurants.  Thus, 

(5.3) 

and 

(5.4) 
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Using  these  expressions  the  apparent  contrast  observed  at  a distance  R becomes 


rln\  ir(o)e-^-h(oy^ 

' IsW  ■ iAo)e-'^+iAR) 
iAo)+iARy" 

(5.5) 

7,(0)- /,(0)  1 

/„(0)  l + [/,(77)//,(0)]e+“" 

c(o) 

l + [/,(7?)/7,(0)>-^“" 

The  ratio  of  the  apparent  contrast  to  the  inherent  contrast  is  known  as  the  contrast  transmittance 

C{R)  1 

C(0)‘l  + [7,(/?)/7„(0)>"‘^'  * ’ 

If  the  attenuation  is  very  large,  or  if  the  path  radiance  is  very  bright  (as  might  be  caused  by  a highly 
scattering  smoke  cloud  between  the  observer  and  the  target),  or  both,  then  Eq.  (5.6)  can  be 
simplified  to 


= ^.(0) 
C(0)  7,(7?) 


(5.7) 


We  will  use  this  expression  in  the  next  section  to  evaluate  the  performance  of  artificial  obscurants. 

The  path  radiance  becomes  larger  as  the  path  increases  up  to  a limit.  At  some  point  the 
attenuation  of  the  atmosphere  will  start  degrading  the  path  radiance  as  fast  as  that  radiance  grows, 
and  a limiting  value  called  the  path  radiance  limit  4.  The  path  radiance  and  the  path  radiance  limit 
are  related  by  the  expression 

7,(7?)  = 7,(l-e-'^).  (5.8) 


160 


Substituting  for  the  path  radiance  in  Eq.  (5.6)  yields 


C{R)  1 

c(0)  "l  + [/^//,(0)][e+“^-l] 

where  the  quantity  IJI^  is  called  the  sky-to-ground  ratio.  Typical  values  of  I Jig  are  in  the  range 
of  2 to  5 [3]  (although  as  indicated  in  Table  5-1  the  values  can  be  very  widespread  under  extreme 

conditions  [2]).  Now  consider  the  case  where  the  total  attenuation  is  substantial.  Then  » 1 
and  Equation  (5.9)  can  be  simplified  to 

C(R) 

— W-  = T (5-10) 

c(o)  [/,//,(0)] 


— ccR 

that  is,  the  contrast  transmittance  is  proportional  to  the  actual  transmittance  ( e ). 

Now  consider  the  case  where  the  target  is  a black  target  so  that  C(0)  is  -1  independent  of  the 
brightness  of  the  background  and  now  assume  that  the  target  is  viewed  against  the  sky  background. 
This  is  a fairly  accurate  method  of  estimating  meteorological  visibility.  Given  our  assumptions,  we 
have  Ig  = 7g(0)  and 


^ -aR 

c(0)- 


(5.11) 


Now  if  we  assume  a 2%  contrast  is  required  for  detection  and  that  the  range  at  which  detection  is 
barely  possible  is  called  the  visibility  V (refer  back  to  Chapter  3).  Then  we  may  solve  Eq.  (5.11)  to 
obtain  Koschmieder’s  formula 


c(R) 

c(o) 


= 0.02  = 


3.912 
a = 

y 


(5.12) 


Table  5-1.  Typical  sky-to-ground  ratios.  [2] 


SKY  CONDITION 

GROUND  CONDITION 

SKY-TO-GROUND  RATIO 

CLEAR 

FRESH  SNOW 

0.2 

CLEAR 

DESERT 

1.4 

CLEAR 

FOREST 

5 

OVERCAST 

FRESH  SNOW 

1 

OVERCAST 

DESERT 

7 

OVERCAST 

FOREST 

25 

161 


Smoke,  Dust,  and  Battlefield  Obscurants 

Obscurants  are  aerosols  other  than  fog  or  haze,  that  degrade  visibility  to  unacceptably  low 
levels.  In  the  form  of  smoke  from  fires  and/or  weapon  discharges,  and  dust  from  explosions  and/or 
vehicular  traffic,  obscurants  have  contributed  to  the  “fog  of  war”  ever  since  the  beginnings  of 
organized  warfare.  In  the  twentieth  century,  the  development  of  artificial  obscurants,  the  so-called 
“military  smokes”,  has  made  it  possible  to  affect  and  control  the  visibility  almost  anywhere  in  the 
battlespace. 

Obscurants  can  affect  sensor  performance  in  four  ways:  absorption,  scattering,  turbulence, 
and  cloud  radiance.  Some  obscurants  absorb  electromagnetic  radiation.  The  absorption  reduces 
the  signal  strength  available  for  detection  or  illumination.  Since  obscurants  can  never  be  perfectly 
uniformly  distributed  in  space  (they  are  generated  at  discrete  points),  the  inhomogeneous  absorption 
will  lead  to  low  frequency  spatial  and  temporal  modulation  of  the  measured  signal.  All  obscurants 
will  scatter  electromagnetic  radiation.  Just  like  absorption,  scattering  will  reduce  the  signal  strength 
available  for  detection  or  illumination,  and  can  lead  to  low  frequency  spatial  and  temporal 
modulation  of  signals.  The  scattering  also  leads  to  spreading  of  electromagnetic  beams  and 
diffusion  of  image  data.  It  can  lead  to  temporal  dispersion  (lengthening)  of  short  electromagnetic 
pulses  (although  this  is  usually  an  insignificant  effect).  Scattering  can  also  lead  to  depolarization 
of  electromagnetic  radiation.  Many  obscurant  production  systems  involve  heat  sources,  such  as 
pyrotechnic  smoke  canisters,  spraying  oil  on  hot  engine  exhaust  manifolds,  or  incendiary  grenades. 
As  a consequence  of  the  non-uniform  introduction  of  thermal  energy  into  the  atmosphere, 
turbulence  may  be  enhanced.  As  will  be  discussed  in  more  detail  later  in  this  chapter,  turbulence 
can  cause  random  steering  and  spreading  of  electromagnetic  beams,  introduction  of  amplitude 
fluctuations  (scintillation)  into  signals,  and  distortion  and/or  diffusion  of  images.  The  heat  retained 
by  the  obscurant  cloud  will  result  in  a certain  amount  of  cloud  radiance.  Usually  this  radiance  will 
be  in  the  infrared,  although  some  smokes  (e.g.,  burning  phosphorus)  may  emit  significant  amounts 
of  visible  radiation  for  a brief  period  of  time.  An  additional  radiance  contribution  may  result  from 
the  scattering  of  ambient  radiation  (sunlight  or  earthshine)  into  a sensor’s  line  of  sight.  The  next 
effects  of  cloud  radiance  are  reduced  contrast,  increased  shot  noise,  and  possible  low  frequency 
spatial  and  temporal  modulation  of  background  radiance. 

Smoke  and  dust  will  be  produced  with  some  degree  of  randomness  on  the  battlefield. 
Artificial  military  aerosols  (battlefield  “smokes”)  are  dispensed  with  the  intent  to  establish  specific 
useful  conditions.  Their  use  is  neither  random  nor  unpredictable.  Screening  smoke  has  four 
primary  modes  of  application.  A smoke  blanket  is  a dense  smoke  concentration  established  over 
a friendly  area  to  prevent  enemy  observation  and  precision  aerial  bombing  [4].  It  has  the  drawback 
of  significantly  restricting  friendly  activity  and  movement  within  the  blanket.  The  attenuation  along 
any  path  to  the  exterior  of  the  smoke  blanket  (including  straight  up)  must  exceed  17  dB  (remember 
the  definition  of  visibility  in  Chapter  3).  This  will  require  the  visibility  inside  the  smoke  blanket  to 
be  of  the  order  of  50-100  m,  equivalent  to  a thick  fog.  The  self-protection  smoke  produced  by  the 
smoke  grenades  on  armored  vehicles  is  a limited  application  of  a smoke  blanket.  A smoke  haze  is 
a light  smoke  concentration  established  over  a friendly  area  to  reduce  enemy  visual  observation  by 
means  other  than  direct  overflight.  Resulting  visibility  is  typically  135-180  m,  which  is  sufficient 
to  permit  near  normal  friendly  movement  and  activity.  A smoke  curtain  is  a vertical  screen 
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established  between  friendly  and  enemy  forces  to  prevent  any  direct  observation  of  the  activities  of 
either  side.  Minimum  useful  one-way  attenuation  for  a smoke  curtain  is  17  dB.  Obscuring  smoke 
is  a dense  smoke  concentration  established  over  enemy  positions  to  restrict  enemy  observation  and 
operations.  Resulting  visibility  should  be  less  than  or  equal  to  50  m (equivalent  to  a dense  fog), 
which  is  sufficient  to  slow  vehicular  traffic  to  a crawl,  prevent  visual  coordination  between  adjacent 
units,  and  make  it  difficult  to  correlate  position  against  map  references. 

The  transmission  of  smoke,  dust,  or  military  aerosol  clouds  is  given  by  the  expression 


T = 


(5.13) 


where  a is  the  mass  extinction  coefficient  (with  units  of  mVg),  C is  the  aerosol  concentration  (in 
g/m^),  and  L is  the  optical  path  length  through  the  aerosol  cloud.  For  simplicity  of  use,  aerosol 
clouds  are  often  characterized  by  a path-averaged  concentration-path  length  product,  CL,  in  which 
case  the  transmission  is  given  by 


An  orCL  product  of  4 yields  7=  0.075  or  17.4  dB  attenuation.  Thus,  aCL=4  provides  an  attenuation 
comparable  to  that  used  to  define  visibility.  As  a result  this  value  can  be  used  to  define  the 
attenuation  of  an  effective  smoke  curtain. 

Table  5-2  summarizes  data  on  a number  of  common  battlefield  aerosols.  [1], [5], [6]  Included 
in  the  table  are  military  nomenclature  if  applicable,  the  generic  composition  of  the  aerosols,  typical 
aerosol  size,  and  mass  extinction  rations  at  a number  of  wavelengths.  The  unfortunate  lack  of  open 
source  data  leads  to  blanks  in  the  columns  of  a few  aerosol  types.  Table  5-3  summarizes  the  effects 
of  relative  humidity  on  the  obscuration  characteristics  of  two  of  the  most  important  artillery 
delivered  smokes  - WP  (white  phosphorus)  and  HC  (Zn/Hexachloroethane).[6]  In  general  higher 
relative  humidity  leads  to  more  extraction  of  water  vapor  from  the  air,  with  a higher  yield  ratio 
(mass  of  smoke  particles  per  mass  of  initial  smoke  compound)  and  larger  particle  sizes.  The  latter 
favors  increased  infrared  attenuation,  but  also  favors  faster  settling  of  the  smoke  particles  out  of 
suspension  in  the  air.  Table  5-4  summarizes  the  obscuration  characteristics  of  various  kinds  of  dust 
and  blowing  snow,  two  natural  obscurants  that  are  common  generated  by  the  action  of  weapons  on 
the  soil  or  by  the  movement  of  vehicles  or  helicopters. [6] 

Useful  insights  can  be  obtained  by  comparing  the  attenuation  performance  of  several 
common  military  smokes.  Figure  5-1  compares  the  attenuation  versus  concentration-path  length 
product  for  FS,  HC,  WP,  and  two  kinds  of  fog  oil  (A  - explosively  dispersed  and  B - generated  by 
dripping  oil  onto  a hot  plate).  The  attenuations  in  the  visible  (0.63  pm)  are  compared  with  those  in 
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Table  5-2.  Extinction  data  on  a variety  of  battlefield  aerosols. [1], [5], [6] 


TYPE 

NOMEN- 

CLATURE 

APPLICABLE 

MIL-SPEC 

AEROSOL 

COMPOSITION 

MEDIAN 

AEROSOL 

niiUFTFR 

MASS  EXTINCTION  COEFFICI 

5 

ENT  (m  /9) 

74 

WAVELENGTH  (um)  I 

(um) 

0.4- 

0,7 

0.63 

0.7- 

1.2 

0.9 

1.06 

2.06 

3-6 

6-12 

10.6 

PHOSPHOROUS 

RP 

MIL-P-211 

H-PO  .r»H  0 

1.1 

. 

1.8 

_ 

1.6 

. 

. 

. 

0.33 

WP 

MIL-W-216 

- 

2.98 

- 

2.46 

2.26 

- 

0.48 

0.27 

0.26 

PWP 

MIL-C-00337 

HEXACHLOROETHANE/ 

HC 

MIL-H-236 

ZnCL*nH  O 

1.3 

- 

1.4 

1.6 

_ 

. 

. 

0.12 

ZnO/AI 

- 

1.81 

- 

1.98 

1.93 

- 

0.31 

0.07 

0.09 

SULFUR  TRIOXIDE/ 

FS 

MIL-C-379 

H SO  *nH  0 

0.66 

- 

4.6 

. 

3.1 

2.6 

0.4 

. 

0.20 

CHL0R08ULF0NIC  ACID 

OBSOLETE 

2 4 2 

TITANIUM 

FM 

MIL-C-367 

TI(OH)  4HCI 

. 

. 

. 

_ 

. 

_ 

_ 

_ 

TETRACHLORIDE 

•nH^O^ 

FOG  OIL  - TYPE  8QF-2 

FOG  OIL 

MIL-F-12070 

OIL  DROPLETS 

- 

6.86 

_ 

4.69 

3.48 

_ 

0.26 

0.02 

0.02 

- PYROTECHNIC 

0.6 

- 

4.2 

- 

4.6 

3.1 

1.1 

- 

- 

0.03 

SOURCE 

- HOT  PLATE  SOURCE 

3.4 

- 

2.9 

- 

2.6 

2.2 

1.4 

- 

0.20 

DIESEL  OIL 

7 

- 

OIL  DROPLETS 

- 

6.40 

- 

3.69 

2.94 

- 

1.34 

o 

o 

1.00 

ANTHRACENE 

7 

7 

CRYSTALS 

- 

6.00 

- 

3.60 

2.00 

- 

0.23 

0.06 

0.06 

YER8HOV  COMPOSITION 

7 

- 

NH.ChnH^O 
4 2 

- 

- 

- 

- 

- 

- 

- 

- 

- 

CARBON 

- SOOT 

- 

- 

PARTICLES 

- 

1.60 

- 

1 .46 

* 

1.42 

- 

0.76 

0.32 

0.30 

- GRAPHITE 

7 

7 

FLAKES 

- 

- 

- 

- 

- 

- 

- 

- 

- 

- 

BURNING  DIESEL  FUEL 

- 

- 

SOOT  4 

. 

6.40 

. 

3.69 

. 

2.94 

_ 

1.34 

1.00 

1.00 

HYDROCARBONS 

the  infrared  (10.6  (j,m).  Recall  that  an  attenuation  of  17  dB  {aCL  = 4)  defines  an  effective  smoke 
“thickness”  in  the  visible.  If  a marginally  visually  effective  smoke  cloud  is  produced  for  any  of  these 
smokes,  then  it  is  readily  apparent  that  the  attenuation  in  the  infrared  will  be  negligible  (approxi- 
mately 1-2  dB).  Because  the  mass  extinction  coefficients  are  much  smaller,  effective  obscuration 
in  the  infrared  requires  CL  products  an  order  of  magnitude  larger  than  necessary  to  produce  effective 
obscuration  in  the  visible. 

Many  military  obscurants  have  much  larger  scattering  coefficients  than  absorption 
coefficients.  These  materials  produce  “white”  smokes.  However,  some  obscurants  are  dominated 
by  absorption  rather  than  scattering.  These  materials  produce  what  are  known  as  “black”  smokes. 
Their  effects  are  somewhat  different.  Consider  the  contrast  transmittance 


C(/?)  7.(0) 

C(0)  /,(R) 


(5.15) 


If  the  path  radiance  is  due  primarily  to  a smoke  cloud  between  the  observer  and  the  target,  we  may 
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Table  5-3.  Relative  humidity  effects  on  the  extinction  of  WP  and  HC  aerosols. [6] 


WP  SMOKE 


RELATIVE 

HUMIDITY 

% 

YIELD 
FACTOR 
kg  smoke/ 
kg  WP 

MASS  EXTINCTION  COEFFICIENT  (m^/g) 

WWELENGTH  (urn) 

0.4-0. 7 

0.7-1. 2 

1.06 

3-5 

8-12 

10.6 

5 

2.66 

1,50 

1.34 

0.26 

0.38 

0.35 

10 

2.94 

1.51 

1.30 

0.29 

0.38 

0.36 

30 

3.91 

3.76 

1.60 

1.26 

0.33 

0.38 

0.39 

50 

4.34 

4.08 

1.77 

1.37 

0.29 

0.38 

0.38 

70 

5.10 

3.90 

2.03 

1.66 

0.29 

0.36 

0.35 

90 

7.85 

3.23 

2.36 

2.1  1 

0.41 

0.30 

0.28 

95 

1 1.70 

2.98 

2.46 

2.25 

0.48 

0.27 

0.26 

HC  SMOKE 


RELATIVE 

HUMIDITY 

% 

YIELD 
FACTOR 
kg  smoke/ 
kg  HC 

MASS  EXTINCTION  COEFFICIENT  (m^/g) 

WAVELENGTH  (um) 

0.4-0. 7 

0.7-1. 2 

1.06 

3-5 

8-12 

10.6 

5 

1.39 

2.76 

1.67 

1.40 

0.17 

0.01 

10 

1.46 

3.00 

1.87 

1.56 

0.19 

WSSM 

30 

1.59 

3.60 

2.44 

2.04 

0.21 

■H 

0.03 

50 

1.89 

3.66 

2.67 

2.28 

0.19 

0.03 

0.04 

70 

2.40 

3.18 

2.57 

2.28 

0.19 

0.03 

0.05 

90 

5.72 

2.15 

2.14 

2.03 

0,27 

0.06 

0.08 

95 

10.49 

1.81 

1.98 

1.93 

0.31 

0.07 

0.09 

Table  5-4.  Mass  extinction  coefficient  of  lofted  dust  and  snow  particles.  [6] 


MASS  EXTINCTION  COEFFICIENT  (m^/g) 

OBSCURANT 

YIELD  FACTOR 

WAVELENGTH  (um) 

0.4-0. 7 

0.7-1. 2 

1.06 

3-6 

8-12 

10.6 

HE  DUST,  SMALL 
SMALL 

— 

0.32 

0.29 

0.26 

0.27 

0.27 

0.24 

HE  DUST,  LARGE 
LARGE 

— 

0.04 

0.04 

0.04 

0.04 

0.04 

0.04 

VEHICULAR  DUST 
DRY  SOIL, 

65%  SILT 

2.3(APC),  4.4(TANK) 
kg  per  mph  per  mile 

0.32 

0.30 

0.29 

0.27 

0.25 

0.25 

HELO-LOFTED  DUST 
DRY  SOIL 

I. 6-5.5  g/m3  HOVER 

II. 9  g/m®  TAKEOFF 

0.32 

0.30 

0.29 

0.27 

0.25 

0.25 

LOFTED  SNOW 
DRY  SNOW 

0.2  - 2.8  g/m3 
1.5  g/m3  AVERAGE 

0.03 

0.03 

0.03 

0.04 

0.05 

0.05 
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Figure  5-1.  Comparison  of  visible  and  infrared  attenuation  produced  by  military  smokes. 


estimate  the  path  radiance  given  knowledge  of  the  external  illumination.  We  will  represent  the  total 
attenuation  coefficient  a by  the  sum  of  the  absorption  coefficient  and  the  scattering  coefficient 
a^.  If  is  the  external  illumination  due  to  sunlight,  moonlight,  skyshine  (scattered  sunlight), 
earthshine  (scattered  sunlight  and  thermal  emission),  or  other  radiation  source,  then  the  fraction  of 
that  illumination  scattered  per  unit  length  of  obscurant  is  given  by 

dig  - -dig  - IgUgdx  (5.16) 

The  obscurant  nominally  scatters  its  radiation  into  4n  sr.  There  is  an  angular  dependence  to  the 
scattering  that  should  be  taken  into  account,  however,  we  will  simply  denote  this  dependence  by  a 
generic  angle  dependent  efficiency /(^/4ti:.  If  we  then  sum  all  of  the  scattered  contributions,  we 
obtain  the  overall  scattered  signal  as 

Io  = Is{l-e~'^’‘-)f(d)/4!r  . (5.17) 

This  term  can  be  substituted  into  Eq.  (5.15)  in  place  of  the  path  radiance. 
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(5.18) 


c{R)  is{o) 

C(0)  I,{l-e-“’^)f{e)/4;r 

Note  that  if  the  scattering  coefficient  is  small,  the  obscurant  radiance  will  be  small.  Since  this  is  the 
effective  path  radiance,  then  the  contrast  transmittance  is  increased.  Black  smokes  require  larger 
overall  attenuations  to  produce  a desired  reduction  in  contrast.  The  magnitude  of  the  effect  also 
depends  on  the  strength  of  the  external  illumination.  The  subject  of  white  and  black  smokes  could 
be  discussed  in  considerably  more  detail.  However,  such  discussion  is  beyond  the  scope  of  this 
work. 


Figure  5-2  shows  the  effects  of  “gray”  smokes  on  detection  of  a target  by  a small  television 
sensor.  The  detection  range  is  plotted  versus  aCL  (ACL  in  the  figure)  for  two  different  external 
illumination  levels  (daylight  - 10000  ft-L  and  dusk  - 10  ft-L).  Hypothetical  smokes  with  different 
scattering  ratios  rj 

ri-a^la  (5.19) 

ranging  from  0 (pure  absorption)  to  0.9  (strongly  scattering).  Examination  of  the  figure  clearly 
indicates  that  considerably  larger  aCL  values  are  needed  to  impact  detection  range  when  “black” 
smokes  are  employed  than  when  “white”  smokes  are  employed.  At  low  light  levels,  then  any 
additional  reduction  in  contrast  is  magnified,  although  the  “white  smoke  is  better  than  black  smoke” 
comparison  still  holds. 


Figure  5-2.  Comparison  of  white/black  smokes  effects  on  television  detection  performance. 
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Pasquill  Stability  and  Atmospheric  Diffusion 

The  effectiveness  of  employment  of  military  obscurants  or  chemical  and  biological  warfare 
agents  depends  on  many  atmospheric  factors.  One  of  the  most  important  of  these  is  the  Pasquill 
stability.  The  Pasquill  stability  conditions  relate  to  the  rate  of  vertical  diffusion  of  particles.  The 
more  stable  the  atmosphere,  the  less  vertical  transport  of  particulates  occurs.  In  addition,  the 
horizontal  diffusion  of  particles  (absent  the  effects  of  variable  wind  directions)  tends  to  correlate 
well  with  the  vertical  diffusion.  The  more  stable  the  atmosphere,  the  less  horizontal  diffusion  of 
particulates  occurs.  Pasquill  [7], [8]  identified  six  levels  of  stability  as  shown  in  Table  5-5.  A 
seventh  category  (G  or  7 - Extremely  Stable)  was  added  later. 


Table  5-5.  Pasquill  Stability  Conditions  [7] -[9] 

CONDITION  PASQUILL  DESIGNATION  ALTERNATE 

DESIGNATION 

EXTREMELY  UNSTABLE 

A 

1 

UNSTABLE 

B 

2 

SLIGHTLY  UNSTABLE 

C 

3 

NEUTRAL 

D 

4 

SLIGHTLY  STABLE 

E 

5 

STABLE 

F 

6 

EXTREMELY  STABLE 

G 

7 

Turner  [9]  further  developed  a correlation  between  wind  speed,  cloud  cover,  and  insolation 
(amount  of  incident  solar  radiation)  and  the  stability  condition.  Specifically  he  defined  a net 
radiation  index  according  to  the  following  criteria: 

1)  Net  Radiation  Index  = 0 if  cloud  cover  is  10/10  and  ceiling  is  less  than  7000  feet  whether 
day  or  night; 

2)  Nighttime  (between  sunset  and  sunrise)  - a)  Net  Radiation  Index  = -2,  if  cloud  cover  is 
less  than  or  equal  to  4/10  and  b)  Net  Radiation  Index  = -1 , if  cloud  cover  is  greater  than  4/10. 

3)  Daytime  - a)  Determine  Insolation  Class  from  Table  5-6; 

b)  Net  Radiation  Index  = Insolation  Class,  if  cloud  cover  is  less  than  or  equal 
to  5/10; 

c)  Net  Radiation  Index  = Adjusted  Insolation  Class,  if  cloud  cover  is  greater 
than  5/10. 

Adjust  the  Insolation  Class  according  to: 

i)  Subtract  2 if  ceiling  is  less  than  7000  feet; 

ii)  Subtract  1 if  ceiling  is  between  7000  feet  and  16,000  feet; 

iii)  Subtract  1 if  total  cloud  cover  is  10/10  and  ceiling  is  above  7000  feet; 

iv)  If  Insolation  Class  has  not  been  modified  by  above,  then  assume  the  Adjusted 
Insolation  Class  is  the  same  as  the  Insolation  Class; 

v)  If  the  Adjusted  Insolation  Class  is  less  than  1,  set  it  equal  to  1. 

Given  the  Net  Radiation  Index  and  the  Wind  Speed,  the  Pasquill  Stability  class  is  determined  from 
Table  5-7. 
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Table  5-6.  Insolation  Class  Number.  [9] 


SOLAR  ALTITUDE  - A 

INSOLATION 

INSOLATION  CLASS  # 

A > 60° 

STRONG 

4 

35°  < A < 60° 

MODERATE 

3 

15°  < A<35° 

SLIGHT 

2 

A<  15° 

WEAK 

1 

Table  5-7.  Stability  Class  as  a function  of  net  radiation  index  and  wind  speed.  [9] 


WIND  SPEED 
(knots) 

0-2 

2-4 

4-6 

6- 7 

7- 8 

8- 10 
10-1 1 
11-12 

>12 


NET  RADIATION  INDEX 


4 3 2 

1 1 2 

1 2 2 

1 2 3 

2 2 3 

2 2 3 

2 3 3 

3 3 4 

3 3 4 

3 4 4 


1 Q_  ^ 

3 4 6 

3 4 6 

4 4 5 

4 4 5 

4 4 4 

4 4 4 

4 4 4 

4 4 4 

4 4 4 


7 

7 

6 

6 

5 

5 

5 

4 

4 


The  Stability  Class  correlates  with  the  rate  of  vertical  diffusion  of  particulates.  Figure  5-3 
illustrates  the  behavior  of  a smoke  plume  for  nominal  stable  (class  F(6)),  neutral  (class  D(4)),  and 
unstable  (class  B(2))  atmospheres.  The  stable  plume  reaches  a stable  vertical  thickness  (determined 
by  the  height  of  the  inversion  layer)  and  then  does  not  diffuse  vertically  any  farther.  The  neutral 
plume  diffuses  slowly  but  smoothly  to  ever-increasing  thicknesses.  The  particulate  concentration 
remains  highest  near  the  surface.  The  unstable  plume  diffuses  rapidly  and  may  actually  lift  from  the 
surface.  The  influence  on  performance  should  be  obvious. 

Pasquill  also  estimated  the  amount  of  diffusion  that  could  be  expected  for  each  stability  class. 
Consider  a continuous  emitter  of  material  (such  as  a smokestack,  a smoke  generator,  or  a venting 
gas  tank)  located  at  ground  level  and  releasing  Q kg/s  of  material.  At  a distance  d from  the  source, 
the  plume  will  exhibit  a roughly  Gaussian  concentration  profile  in  the  vertical  direction  (see  Figure 
5-4)  given  by 


C(z)~C{0)e  (5.20) 

where  is  a scale  height  which  we  have  defined  to  be  the  1/e^  height  (Pasquill  actually  used  the  10% 
height  but  any  errors  introduced  by  this  difference  will  be  small)  and  may  be  found  from  Figure  5-5 
as  a function  of  stability  class  and  downwind  distance  d. 
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AVERAGE  WIND  DIRECTION  ■ 


Figure  5-5.  Angular  diffusion  rates  for  different  Pasquill  stability  categories. 


Note  the  curve  for  class  D splits  into  two  curves.  Curve  (1)  should  be  used  up  to  the  top  of 
the  dry  adiabatic  layer.  Above  this  layer,  in  sub-adiabatic  conditions,  a curve  parallel  to  curve  (2) 
with  value  at  the  top  of  the  dry  adiabatic  layer  equal  to  the  value  of  curve  (1)  at  the  top  of  the  dry 
adiabatic  layer  should  be  used. 

Diffusion  in  the  horizontal  direction  is  different  than  diffusion  in  the  vertical  dimension.  The 
transverse  diffusion  has  two  components.  One  will  be  a truly  random  diffusion  that  is  similar  (but 
not  equal  to)  the  diffusion  in  the  vertical  direction.  Of  greater  significance,  however,  is  term  caused 
by  time  variations  in  wind  direction.  As  the  plume  moves  downwind,  the  wind  will  fluctuate  in 
direction  about  the  long-term  average  wind.  The  spatial  scale  of  these  fluctuations  is  relative  large 
although  the  temporal  scale  may  be  as  short  as  seconds.  Large  regions  of  the  plume  will  temporarily 
move  in  one  direction  and  then  another.  Figure  5-4  shows  a “snapshot”  of  how  the  horizontal 
diffusion  might  look  at  one  instant  in  time. 
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If  one  measures  the  concentration  as  a function  of  time  and  averages  over  a relative  long 
time,  the  concentration  as  a function  of  downwind  distance  x = d and  transverse  distance  y from  the 
long-term  average  wind  vector  (with  origin  assumed  to  be  at  the  source)  will  also  be  Gaussian 

C(y) « (5.21) 

where  ^is  the  total  angular  width  of  the  plume  (defined  to  be  the  full  angle  at  1/e^  concentration 
points)  and  has  been  given  in  Figure  5-5  for  short  and  long  ranges. 

Combining  the  horizontal  and  vertical  diffusion  terms  and  normalizing  the  distributions  to 
equal  the  source  emission  rate  yields 


C(d,y,z) 
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(5.22) 


where  C has  units  of  kg/m^  if  Q has  units  of  kg/s,  h and  d have  units  of  meters,  m is  the  average  wind 
velocity  in  m/s,  and  6>is  in  radians. 

If  the  source  is  elevated  more  than  a few  meters  above  the  ground,  it  is  necessary  to  account 
for  downward  diffusion  as  well  as  upward  diffusion.  The  two  rates  may  be  assumed  to  be  equal. 
Convection  will  cause  the  upward  rate  to  be  slightly  higher,  but  particulate  settling  will  tend  to 
increase  the  downward  rate.  Except  in  extremely  unstable  conditions  or  if  particulates  are  very 
large,  neither  effect  should  dominate  the  random  isotropic  diffusion  caused  by  atmospheric 
turbulence.  Assuming  the  downward  rate  is  equal  to  the  upward  allows  the  vertical  concentration 
equation  to  be  modified  by  shifting  the  mean  and  adding  a term  which  describes  “reflection”  of  the 
diffusion  from  the  ground.  If  the  source  is  at  height  H then  we  have 
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as  the  governing  concentration  equation. 

The  results  in  Eqs.  (5.22)  and  (5.23)  are  long-term  average  values.  Over  the  long  term,  the 
fluctuations  in  wind  speed  and  direction  producing  the  sinuous  behavior  of  Figure  5-4  will  average 
out.  At  some  points  in  space  and  time,  the  equations  will  overestimate  the  actual  concentrations. 
At  other  points,  they  will  underestimate  the  concentrations.  But  over  the  course  of  many  minutes, 
the  average  values  will  prevail. 

If  local  real-time  meteorology  data  is  available  giving  actual  wind  speed  and  direction  traces, 
then  better  concentration  estimates  can  be  obtained.  Even  better  results  are  obtained  if  the  point 
meteorological  data  is  used  as  inputs  to  a micrometeorological  prediction  code  that  also  uses  real 
terrain  data  to  refine  the  spatial  variations  in  wind  speed  and  direction.  More  detailed  discussion 
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of  these  codes  is  beyond  the  scope  of  the  present  work. 

The  preceding  discussion  was  pertinent  to  a continuous,  slow  material  release  from  a point 
source.  Releases  that  are  localized  in  space  and  time  behave  slightly  differently.  Consider  the 
explosive  release  of  a quantity  of  material.  This  might  result  from  the  explosion  of  an  artillery  shell 
filled  with  smoke  agent.  The  explosion  rapidly  produces  a cloud  of  material  which  will  have 
substantial  but  limited  dimensions.  An  artillery  shell  might  produce  a roughly  spherical  cloud  50 
meters  in  radius.  This  puff  as  it  is  sometimes  called  will  then  diffuse  and  drift  with  the  wind.  Its 
small  size  means  that  it  will  not  develop  the  sinuous  character  shown  in  Figure  5-4.  Diffusion  will 
be  dominated  more  by  turbulence  than  by  wind  motions.  A reasonable  assumption  is  that  the 
horizontal  diffusion  (both  transverse  to  the  wind  and  along  the  wind)  of  the  puff  will  be  comparable 
to  the  vertical  diffusion.  If  the  puff  consisted  of  a release  of  M kg  of  material  at  ground  level  into 
a hemispherical  cloud  initially  characterized  by  radius  R,  then  at  a distance  d we  would  have 
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where  uT  is  the  distance  the  cloud  would  drift  in  time  T (the  assumed  time  of  observation)  and  h 
would  be  determined  from  Figure  5-5. 

Eq.  (5.24)  gives  the  concentration  of  the  puff  about  the  center  of  the  puff  as  it  moves. 
However,  the  puff  itself  will  follow  a sinuous  course  similar  to  that  shown  in  Figure  5-4.  The  puff 
will  pass  over  some  points  and  not  pass  over  others.  Comparing  the  concentration  calculated  from 
Eq.  (5.24)  with  the  concentration  associated  with  the  long  term  average  values  for  the  continuous- 
release  source  gives  an  estimate  of  the  probability  that  a single  puff  will  pass  over  a given  point. 

An  expression  related  to  Eq.  (5.24)  could  be  generated  for  a puff  released  at  a height  H above 
the  ground.  This  is  left  as  an  exercise  for  the  reader.  Other  source  geometries  can  be  considered 
by  recognizing  that  any  complex  cloud  geometry  can  be  decomposed  into  a number  of  small  roughly 
spherical  puffs.  Eor  example,  a reasonable  approximation  to  the  continuous  emission  case  described 
earlier  would  be  to  assume  a regular  release  of  puffs  such  that  as  soon  as  one  puff  had  drifted  a 
distance  2R  from  the  source  point  another  puff  was  produced.  The  line  source  is  another  interesting 
geometry.  It  is  produced  when  a vehicle-mounted  spray  tank  is  used  as  the  release  mechanism.  The 
material  is  released  in  a tight  cloud  that  stretches  over  the  distance  traveled  by  the  platform.  In  an 
aircraft  spray  situation,  the  material  may  be  emitted  into  a source  several  kilometers  long  (e.g.,  10 
second  emission  at  300  m/s  is  3 km)  before  any  appreciable  drift  or  diffusion  can  occur.  The 
behavior  of  a line  source  can  be  estimated  by  assuming  that  it  is  produced  instantaneously  in  a 
cylindrical  geometry  of  a given  radius,  and  then  applying  Gaussian  diffusion  in  all  three  dimensions. 
The  mathematics  of  this  will  not  be  developed  here. 
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The  Pasquill  Stability  class  also  correlates  with  other  meteorological  parameters  related  to 
the  effectiveness  of  employing  obscurants  (Table  5-8).  For  example,  the  wind  speed  profile  with 
height  is  often  approximated  by 


u{z)  _ 

f-1 

u(zr) 

(5.20) 


Table  5-8.  Relationship  of  Pasquill  stability  classes  to  other  stability  parameters.  [6] 


PASQUILL  STABILITY 

A 

B 

C 

D 

E 

F 

(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

MEAN  WIND  SPEED  (tn/s) 

3 

4 

5 

8 

5 

3 

MIXING  HEIGHT  (m) 

1000- 

500- 

250- 

100- 

50- 

30- 

3000 

ROUGHNESS 

FACTOR  PARAMETER 

2000 

1000 

800 

200 

50 

Zfl  (m)  = 0.01  Up 

0.201 

0.259 

0.279 

0.463 

0.230 

0.094 

P 

0.062 

0.083 

0.100 

0.145 

0.319 

0.536 

Ri 

-3.58 

-0.87 

-0.34 

0 

0.33 

2.66 

L 

-7.8 

-16.8 

-51.5 

00 

65.3 

14.0 

-157 

-94.0 

-33.0 

0 

21.5 

40.9 

S 

-1.24 

-0.96 

-0.85 

0 

0.84 

0.69 

0.05  Up 

0.275 

0.346 

0.413 

0.604 

0.304 

0.155 

P 

0.086 

0.119 

0.144 

0.189 

0.384 

0.588 

Ri 

-3.26 

-0.67 

-0.21 

0 

0.09 

0.34 

L 

-9.4 

-23.6 

-94.4 

00 

113.7 

22.6 

-178 

-89.4 

-26.7 

0 

18.3 

41.8 

S 

-2.17 

-1.52 

-1.09 

0 

2.72 

1.06 

0.10  Up 

0.326 

0.404 

0.475 

0.695 

0.354 

0.128 

P 

0.103 

0.143 

0.175 

0.217 

0.363 

0.617 

Ri 

-3.12 

-0.57 

-0.16 

0 

0.14 

1.86 

L 

-10.4 

-29.0 

-137 

00 

163.9 

28.9 

-191 

-84.9 

-21.1 

0 

13.2 

27.0 

S 

-1.98 

-1.89 

-1.23 

0 

4.61 

6.37 

0.50  Up 

0.547 

0.649 

0.714 

1.068 

0.554 

0.176 

P 

0.189 

0.262 

0.311 

0.334 

0.447 

0.419 

Ri 

-2.66 

-0.62 

-0.05 

0 

0.07 

1.23 

L 

-14.1 

-70.8 

-568 

00 

332.0 

65.0 

-236 

-55.9 

-7.7 

0 

10.2 

16.5 

s 

-8.55 

-3.57 

-1.21 

0 

3.50 

1.94 

where  Zr  is  a reference  height  (10  m)  and  p is  an  empirical  power  law  exponent.  This  equation  is 
strictly  only  valid  for  neutral  conditions.  A more  useful  profile  is  given  by 
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(5.21) 
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where  Up  is  the  friction  velocity,  k is  the  von  Karman  constant  (=  0.40),  Zo  is  a surface  roughness 
parameter,  and  L the  Monin-Obukhov  length.  The  function  stakes  the  form 
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The  surface  roughness  parameter  Zq  can  be  related  to  the  mean  height  Zp  of  surrounding  terrain 
elements,  including  trees,  grass,  hills,  buildings,  etc.)  by 


In  Zq  - 1.1 9 In  Zg  - 2.85  units  of  meters 


(5.24) 


Table  5-8  also  gives  the  mixing  height.  This  is  the  height  at  which  atmospheric  conditions  are  not 
influenced  by  surface  conditions.  It  can  vary  from  tens  of  meters  under  stable  conditions  to 
kilometers  under  unstable  conditions.  The  Richardson  number  the  Kazanski-Monin  stability 
parameter  ju,  and  the  static  stability  S are  other  stability  related  parameters  that  are  included  in  the 
Table  in  the  spirit  of  a Rosetta  Stone. 
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Transmission  Through  Stratified  Media 

Obscurants,  clouds,  and  fogs  are  typically  stratified  or  localized.  In  these  instances,  use  of 
the  simple  expression 

T = (5.25) 

is  not  appropriate.  If  used  with  the  obscurant  attenuation  coefficient  for  a,  then  Eq.  (5.25)  will 
grossly  overestimate  the  attenuation.  If  the  obscurants,  clouds,  or  fogs  are  indeed  localized, 
stratified,  or  both,  then  more  realistic  calculations  can  be  obtained  using  a discrete  approach. 

Consider  a horizontally  stratified  absorber  such  as  a low  cloud  or  a smoke  plume  as  shown 
in  Figure  5-6.  The  sine  of  the  elevation  angle  (or  depression  angle)  depending  on  your  point  of  view 
is  given  by 


. ^ H T 

sin6'  = — = — = — ^ (5.26) 

R L,  L, 

where  //  is  the  platform  height,  i?  is  the  total  slant  range,  T is  the  absorber  thickness  (or  the  thickness 
of  the  intermediate  layer  if  all  have  substantial  attenuation),  is  the  slant  path  distance  through  the 
central  absorber,  is  the  height  of  the  base  of  the  central  absorbing  layer  (or  the  thickness  of  the 


Figure  5-6.  Geometry  for  calculating  attenuation  in  a horizontally  stratified  environment. 
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bottom  absorbing  layer),  and  Ly  is  the  slant  path  distance  through  the  bottom  layer.  Obviously 


h=R-Lc-I^  (5.27) 

is  the  slant  path  distance  through  the  uppermost  layer.  The  slant  paths  through  the  lower  layers  can 
be  calculated  from 

Ly -T  ! sinO -TR  ! H (5.28) 

and 

Ly  - Hy  ! sin  6 - H^R  / H (5.29) 

The  actual  attenuation  is  the  sum  of  the  slant  path  contributions  from  each  layer 

rp  _ ~^a{R~Lp-Lu^-apLp-aijLij 


-ajR-{T/H)-(Hc/H)]-acR[T/H)-auR[Hc/H) 
- e y J 


(5.30) 


Now  consider  a vertically  stratified  environment  in  which  there  is  a single  vertical 
intervening  absorber  in  an  otherwise  homogeneous  medium,  as  shown  in  Figure  5-7.  The  sine  of 
the  depression 


Figure  5-7.  Geometry  for  calculating  attenuation  in  a vertically  stratified  environment. 
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angle  is  defined  by 


sin  6>  =////?  (5.31) 

where  H is  the  platform  altitude  and  R is  the  total  slant  path  distance.  The  cosine  of  the  depression 
angle  can  be  readily  determined  to  be 

1/2 

cos6>=r/Lc  !R.  (5.32) 

The  slant  path  distance  through  the  smoke  can  now  be  evaluated  to  be 

Lc  ^ RT ! [R^  - . (5.33) 

The  total  path  attenuation  is  the  sum  of  the  attenuation  due  to  the  background  plus  the  attenuation 
due  to  the  cloud 


j.  _ ^~(^a[R~Lc)-OCc^C 


-UaR-a^Lc 


(5.34) 


If  the  thickness  of  the  cloud  is  small  compared  to  the  slant  path  distance  {T«R),  then  the  final 
approximation  in  Eq.(5.34)  is  a very  good  one. 
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Multipath 


Radars  at  low  altitudes  above  the  ground  often  face  conditions  when  the  radar  beam 
simultaneously  illuminates  both  the  ground  and  the  target.  In  this  case,  the  radiation  can  take  at  least 
four  distinct  paths  between  the  transmitter  to  the  target  and  back  to  the  receiver.  As  shown  in  Figure 
5-8  there  is  the  desired  direct  path  (A  to  C to  A).  There  are  two  paths  that  bounce  once  off  the 
ground  (A  to  B to  C to  A and  A to  C to  B to  A)  called  diplane  paths.  There  is  also  one  path  that 
bounces  twice  off  the  ground  (A  to  B to  C to  B to  A),  occasionally  called  the  mirror  image  path. 

If  we  assume  that  the  radar  is  pointed  at  the  target,  then  the  relative  gain  in  the  direction  of 
the  ground  bounce  is  AG.  The  effective  reflectance  of  the  ground  is  denoted  by  p.  L denotes  the 
distance  between  the  radar  and  the  target  along  the  direct  path.  S denotes  the  distance  between  the 
radar  and  the  target  along  the  bounce  path.  The  relative  signal  power  received  at  any  point  is  the 
square  of  the  sum  of  the  four  paths  as  shown  below. [10] 

- = AG^ 


(5.35) 


«"'‘^(i+pag/<*-''> 


Multipath  of  this  type  will  occur  when  the  angle  between  the  target  and  the  ground  bounce  point  is 
less  than  the  beamwidth 


a + /3-idin  ^ [Hj  - //^ ) / /?]  -i-  tan  ^ [Hj  -i-  //^ ) / /?]  < 6^^  (5.36) 


Figure  5-8.  Geometry  for  multipath  lobing  calculations. 
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where  a is  the  elevation  angle  of  the  target  and  y^is  the  depression  angle  of  the  ground  bounce  point, 
R is  the  ground  range  between  target  and  radar,  is  the  height  of  the  radar  above  the  ground,  and 

Hj  is  the  height  of  the  target  above  the  ground.  Nominally,  this  “multipath  condition”  is  satisfied 
when 


H^<XR!2D 


(5.37) 


Equation  (5.35)  can  be  simplified  by  making  the  following  simplifying  approximations 
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Using  these  approximations,  the  path  difference  between  direct  and  bounce  paths  is  given  by 

S - L « 2HjH^  / R . (5.40) 

It  is  also  instructive  to  make  the  additional  simplifying  assumptions:  AG =1  (equivalent  to  assuming 
that  the  target  to  ground  bounce  angle  is  small  compared  to  the  beamwidth)  and  p=-\  (equivalent 
to  assuming  that  the  ground  is  a perfect  conducting  surface  - this  results  in  a ti  phase  shift  which 
gives  the  minus  sign  to  the  reflectance).  Using  these  approximations,  Eq.  (5.35)  becomes 

— «[2sin[k{S-L)/2]]^^16sm^[kH^HJR].  (5.41) 

This  result  implies  that  P/Pg  varies  from  0 to  16  times  the  free  space  value  (Pg)  as  Hj  increases  or 
R increases.  The  nulls  in  the  received  power  occur  at  integer  multiples  (m)  of  n.  That  is,  the  nulls 
occur  when 


niTT  = kHj  Hj^  / R 

(5.42) 

or  when 

Hj  — mXR  / 2Hj^ 

(5.43) 

or 

R — 2HjHj^  ! mX 

(5.44) 

Multipath  interference  is  multiplicative  to  the  normal  range  dependence  of  the  radar 
equation.  If  we  define  Rdi;j  to  be  the  maximum  detection  range  of  the  radar  (against  the  target  of 
interest)  in  the  absence  of  multipath,  then  the  received  power  relative  to  the  minimum  power 
required  for  detection  can  be  written  as 
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If  P/Pdet  is  greater  than  1 , then  the  target  will  be  detectable;  if  it  less  than  1 , it  will  not  be  detectable. 

In  Figures  5-9  and  5-10  we  have  plotted  P/Poet  versus  range  for  two  generic  radars  with 
realistic  parameters.  Figure  5-9  shows  the  effects  of  multipath  on  a search  radar.  The  target  is 
assumed  to  be  a cruise  missile  at  5 m altitude.  The  primary  effect  is  a potential  reduction  of  the 
maximum  detection  range  from  20  km  to  less  than  14  km.  Because  the  reflecting  surface  is  assumed 
to  be  conducting,  the  n phase  per  ground  reflection  leads  to  destructive  interference  at  zero  path 
difference  (m  = 0).  This  causes  performance  at  long  ranges  to  be  worse  than  predicted  by  the  1/R^ 
dependence  of  the  radar  equation.  Such  a reduction  in  maximum  detection  range  could  have  a 
catastrophic  effect  on  the  system  timeline.  If  the  missile  velocity  is  300  m/s,  then  the  available 
reaction  time  is  reduced  from  a maximum  of  67  seconds  to  less  than  47  seconds.  The  other  major 
nulls  occurs  at  ranges  too  short  to  be  of  significance.  Note,  that  if  different  values  of  target  height, 
radar  height,  wavelength,  or  nominal  detection  range  had  been  chosen,  then  the  m=0  interference 
might  have  occurred  at  ranges  too  long  to  be  of  significance. 


Figure  5-9.  Effects  of  multipath  interference  on  the  performance  of  a search  radar. 
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Figure  5-10.  Effects  of  multipath  interference  on  the  performance  of  a tracking  radar. 


Figure  5-10  shows  the  effects  of  multipath  on  a generic  tracking  radar  against  the  same  cruise 
missile  target.  In  this  case,  there  is  an  enhancement  of  detection  at  very  long  ranges  ( 13-20  km  even 
though  the  nominal  detection  range  is  only  10  km).  Unfortunately,  this  is  not  of  much  benefit  to  a 
radar  that  does  not  need  greater  than  10  km  detection  range.  There  is  a broad  null  at  the  nominal 
detection  range,  which  would  cause  complete  loss  of  track  even  if  it  had  been  established  at  the 
longer  ranges.  At  roughly  8.7  km,  the  missile  becomes  detectable  again.  There  is  another  null  at 
5 km  which  is  long  enough  to  potentially  cause  loss  of  track  again.  As  the  missile  comes  closer  and 
closer,  the  nulls  become  more  frequent  but  also  shorter  in  duration  (a  consequence  of  the  1/R* 
dependence  on  the  received  signal).  Note  that  every  null  will  go  to  zero  relative  power  - the 
coarseness  of  the  data  used  in  plotting  the  figures  causes  some  nulls  to  appear  to  stop  at  non-zero 
values.  These  nulls  will  clearly  have  some  effect  on  tracking  accuracy  but  the  exact  effects  would 
have  to  be  determined  through  simulation  of  the  tracking  loop.  It  is  apparent  that  multipath  can 
cause  performance  limitations  in  tracking  as  well  as  search  radars. 
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Terrain  Masking 


Terrain  masking  refers  to  the  difficulty  in  detecting  objects  in  the  open  due  to  terrain,  cultural 
(buildings,  bridges,  levees,  etc.),  or  vegetation  features  interposing  in  the  line  of  sight  (see  Fig.  5- 
11).  It  has  significant  impact  on  mission  accomplishment  in  air-to-surface,  surface-to-air,  and 
surface-to-surface  missions.  Terrain  masking  will  limit  the  ranges  at  which  targets  can  be 
reasonably  detected  and  attacked.  It  will  force  airborne  platforms  to  fly  in  regimes  that  are  less  than 
optimal  from  survivability  aspects.  It  will  shorten  the  timelines  associated  with  target  acquisition, 
tracking  and  weapon  delivery  for  both  attacking  and  defending  systems.  Thus  terrain  masking 
should  be  a serious  concern  to  system  engineers. 

As  is  evident  from  Figure  5-11,  altitude  is  a prime  factor  in  terrain  masking.  At  low 
altitudes,  a clear  line  of  sight  exists  to  only  a fraction  of  the  terrain  surface  and  thus  to  only  a 
fraction  of  the  important  objects  lying  within  a certain  distance  of  the  platform.  If  the  platform  is 
higher,  then  more  (but  not  necessarily  all)  of  the  objects  of  interest  become  visible. 

A number  of  earlier  studies  have  looked  at  various  geographic  regions  and  performed 
analyses  to  determine  the  probability  of  obtaining  a clear  line  of  sight  to  points  on  the  ground  as 
functions  of  platform  elevation  and  ground  range  (distance  along  the  surface  from  the  observation 
point  to  the  target  point).  In  some  of  these  studies  it  was  found  to  be  convenient  to  fit  the  statistical 
probability  of  obtaining  a clear  line  of  sight  to  a point  on  the  ground  to  a function  of  the  form  [11] 

PcLos  ~ ^ (5.46) 

where  R is  the  ground  distance  (in  kilometers),  H is  the  platform  elevation  (in  kilometers),  and  a, 
b,  and  c are  coefficients  resulting  from  the  curve  fitting.  An  example  from  such  a curve  fit  used  the 


Figure  5-11.  Effect  of  platform  elevation  on  line  of  sight. 
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values,  a = -0.0003,  b = 1.658,  c = -1.315  to  represent  “gently  rolling  terrain”. [11]  These  values 
were  used  to  generate  the  curves  in  Figure  5-12.  Examination  of  the  curves  in  this  Figure  indicates 
that  the  probability  of  having  a line  of  sight  out  to  5 km  is  extremely  limited  even  in  benign  terrain, 
unless  platform  altitudes  in  excess  of  30-60  m are  obtained.  If  high  line  of  sight  probabilities  are 
required,  then  very  large  altitudes  (1000  m or  so)  will  be  required. 

Although  line  of  sight  studies  have  been  performed  several  times  in  the  past,  their  limited 
availability  makes  them  difficult  to  use.  They  are  also  not  guaranteed  to  have  data  relevant  to  every 
current  scenario.  Given  the  almost  universal  availability  of  digital  terrain  elevation  data  (DTED) 
maps  and  powerful  laptop  computers,  it  is  almost  as  easy  to  generate  one’s  own  line  of  sight 
probability  data  as  it  is  to  use  the  previous  work.  For  example.  Figure  5-13  shows  a digitized 
topographic  map  of  a portion  of  the  National  Training  Center  at  Fort  Irwin,  CA.[12]  Sometimes, 
examination  of  the  terrain  contours  is  enough  to  provide  usable  information.  At  the  “randomly” 
selected  point  shown  on  the  map  (nominally  on  the  2800  ft  contour  line),  simple  examination  of  the 
directions  with  terrain  contours  higher  than  2800  indicates  that  only  within  the  roughly  120°  arc 
indicated  will  lines  of  sight  greater  than  3000  meters  be  possible.  This  hints  at  an  algorithm  that  one 
might  use  to  obtain  quantitative  line  of  sight  probability  data  for  any  geographical  locale  (for  which 
data  are  available). 


Figure  5-12.  Clear  line  of  sight  probability  for  gently  rolling  terrain 
as  a function  of  ground  distance  and  platform  altitude. 
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Figure  5-13.  Topographic  map  of  a small  portion  of  Fort  Irwin  CA.[12] 


The  algorithm  proceeds  as  follows.  First,  randomly  select  a point  within  the  terrain  of 
interest  and  select  an  observation  altitude.  Second,  for  each  of  the  major  compass  directions,  extract 
the  digital  terrain  values  extending  away  from  the  selected  point.  Then,  using  the  geometry 
indicated  in  Figure  5-14,  calculate  the  tangent  to  the  closest  ground  point  using  the  expression 

idinO  - L!  {^Hq- Hq)  (5.) 

where  L is  the  ground  distance,  Hq  is  the  observation  platform  altitude  above  sea  level,  and  Hq  is 
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Figure  5-14.  Geometry  used  in  line  of  sight  probability  analysis. 


the  ground  elevation  (above  sea  level).  6>is  then  the  angle  relative  to  nadir  (the  point  directly  below 
the  observer)  of  the  line  of  sight  to  the  ground  point.  The  first  value  of  the  tangent  automatically 
becomes  the  running  maximum  value.  The  process  is  repeated  for  the  next  point.  If  the  tangent  of 
the  next  point  exceeds  the  maximum  prior  tangent  value,  then  a line  of  sight  exists  to  that  point.  The 
current  point  then  becomes  the  new  prior  maximum.  This  process  is  repeated  for  each  succeeding 
ground  point  until  a point  is  reached  at  which  the  tangent  is  less  than  the  prior  maximum  (such  as 
point  6 in  Figure  5-14).  If  the  tangent  is  smaller  than  the  prior  maximum,  a clear  line  of  sight  does 
not  exist  to  that  ground  point  (some  earlier  ground  point  (5  in  the  figure  above)  masks  it).  Line  of 
sight  remains  lost  until  the  calculated  tangent  exceed  the  prior  maximum  again  (establishing  a new 
prior  maximum).  The  process  is  repeated  until  all  ground  points  out  to  a desired  maximum  range 
have  been  processed.  The  range  values  for  this  direction  are  then  tagged  (1  or  0)  depending  on 
whether  a line  of  sight  exists.  The  shaded  regions  in  Figure  5-14  are  those  for  which  a clear  line  of 
sight  was  not  obtained.  The  process  is  repeated  for  each  compass  direction  and  for  a large  number 
of  other  points  randomly  scattered  throughout  the  geography  of  interest.  After  several  hundred  runs 
have  been  processed,  the  tagged  values  for  each  range  are  added  and  divided  by  the  total  number 
of  runs.  The  result  is  the  probability  of  clear  line  of  sight  versus  range.  The  entire  process  is 
repeated  for  a variety  of  observation  altitudes  to  obtain  the  height  dependence.  If  desired,  the  line 
of  sight  probability  values  could  be  fit  to  an  equation  of  the  form  of  Eq.  (5.).  There  are  other 
algorithms  that  one  can  find  in  operations  analysis  texts,  if  the  one  described  above  does  not  seem 
attractive.  [13] 
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Underwater  Propagation 

In  Chapter  2 we  calculated  the  attenuation  coefficient  in  the  visible  and  infrared  for  pure 
water  (Figure  2-4)  from  its  complex  refractive  index.  At  that  time  we  noted  that  the  only  region  that 
has  acceptable  transmission  is  the  region  around  500  nm  (the  blue-green  portion  of  the  spectrum. 
The  attenuation  of  sea  water  is  not  greatly  different  from  pure  water  but  does  have  some  minor 
differences.  These  are  mainly  due  to  the  presence  of  particulates  in  sea  water.  Silt,  plankton,  and/or 
microbubbles  (caused  by  wave  breaking)  can  be  present  to  appreciable  extent.  The  particulates  both 
absorb  and  scatter  electromagnetic  radiation  with  scattering  being  a significant  contributor.  The 
attenuation  coefficient  for  a beam  of  radiation  will  depend  on  the  degree  of  “turbidity”  (or 
“muddiness”),  that  is,  the  amount  of  suspended  particulates.  Figure  5-15  compares  the  attenuation 

Figure  5-15.  Attenuation  coefficient  of  sea  water  at  different  locations. [14], [15] 
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coefficient  of  sea  water  at  various  locations  to  that  of  pure  water.  [14], [15]  In  general,  mid-oceanic 
waters  are  relatively  free  of  silt  particulates  and  have  reduced  plankton  levels.  In  the  littoral  waters 
both  the  plankton  eontent  (due  to  wider  availability  of  nutrients)  and  the  silt  eontent  (due  to  inflow 
of  silt-laden  river  waters)  are  expeeted  to  inerease.  Attenuation  is  highest  in  estuaries  and  in  the 
outflow  of  silt-bearing  rivers,  such  as  the  Mississippi  River. 

Another  aspect  of  underwater  propagation  is  shown  in  Figure  5- 16.  [16]  This  figure  shows 
the  effective  attenuation  of  eleetromagnetie  radiation  from  gamma  rays  to  radio  frequencies.  The 
window  at  blue-green  wavelengths  is  clearly  obvious.  However,  at  frequencies  below  1 kHz  in  the 
ELF  (extremely  low  frequency)  region,  the  attenuation  again  drops  below  1 dB/m.  Radio  waves  at 
ELF  frequeneies  can  penetrate  deeply  into  sea  water.  A reasonable  heuristie  is  that  a radio 
communieation  system  can  operate  to  that  depth  at  whieh  an  attenuation  of  100  dB  occurs.  Note  that 
even  if  this  heuristic  were  increased  to  200  dB,  the  difference  in  depth  would  be  only  a factor  of  two. 
Using  this  heuristic,  we  estimate  that  VLF  frequencies  (around  10  kHz)  can  penetrate  to  depths  of 
the  order  of  30  meters,  HF  frequencies  can  only  propagate  to  a depth  of  1 meter  or  so,  and  higher 
frequencies  will  propagate  even  shorter  distances.  On  the  other  hand  ELF  frequeneies  could 
penetrate  to  at  least  100  meters  depth  or  deeper  if  frequencies  of  10-100  Hz  were  used.  The  only 
real  problem  with  communication  at  ELF  frequencies  is  the  extremely  small  bandwidth  that  can  be 
employed.  At  a carrier  frequency  of  100  Hz,  it  may  take  a minute  or  more  to  transmit  a 100  byte 
message. 


Figure  5-16.  Attenuation  of  electromagnetic  radiation  in  sea  water.  [16] 
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Surprisingly,  the  attenuation  also  falls  below  1 dB/m  in  the  gamma  ray  region  of  the 
spectrum.  As  we  see  in  the  Chapter  7,  the  attenuation  coefficient  of  water  falls  below  0.01  cm  ' at 
photon  energies  above  100  MHz.  However,  only  large  particle  accelerators  are  currently  capable 
of  producing  such  gamma  rays,  so  it  may  be  some  time  before  technology  permits  the  exploitation 
of  this  gamma  ray  window. 
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Problems 


5-1.  A manufacturer  claims  that  his  new  sensor  achieves  a contrast  of  0.5  for  a standard  target 
against  the  background.  The  background  intensity  was  measured  to  be  2 units.  What  can 
be  said  about  the  intensity  of  the  signal  from  the  standard  target? 

5-2.  A second  manufacturer  claims  that  his  new  sensor  is  indistinguishable  from  that  of  the  first 
manufacturer.  However,  this  second  manufacturer  measures  a contrast  of  0.25  under  exactly 
the  same  conditions.  Can  he  be  correct  in  his  claims?  If  both  manufacturers  are  completely 
correct  in  their  claims,  what  must  the  target  intensity  be? 

5-3.  A smoke  has  a visible  mass  extinction  coefficient  of  1.0  mVg.  The  same  smoke  has  an 
infrared  mass  extinction  coefficient  of  0.25  m^/g.  If  this  smoke  is  used  in  a smoke  screen 
with  a CL  value  twice  the  minimum  needed  for  an  effective  visible  smoke  screen,  what  is 
the  two-  way  extinction  (out  and  back  through  the  screen)  that  this  screen  provides  in  the 
infrared. 

5-4.  The  sun  is  midway  between  the  horizon  and  the  zenith.  Cloud  cover  is  4/10.  Wind  speed 
is  9 knots.  What  is  the  Pasquill  stability  class?  Is  this  condition  favorable  for  employing 
smoke? 

5-5  There  is  no  rain  but  the  relative  humidity  is  approaching  90%.  Smoke  from  a burning 
vehicle  rises  roughly  50  meters  before  it  abruptly  stops  rising  and  slowly  drifts  laterally.  Are 
the  conditions  good  for  employing  smoke? 

5-6.  A new  countermeasure  smoke  consists  of  gold-plated  microballoons  (hollow  spheres)  with 
radii  randomly  distributed  between  0.5mm  and  1 .5mm.  If  the  scattering  coefficient  of  a 
cloud  of  this  smoke  is  100  km  ' at  300  GHz,  what  scattering  coefficient  do  you  estimate  the 
smoke  to  have 

- at  1 pm? 

- at  10  pm? 

- at  30  GHz? 

- at  3 GHz? 

5-7.  Describe  the  characteristics  of  Mie  scattering  and  quantitatively  discuss  the  impact  of 
scattering  of  scattering  on  attenuation  due  to  fog,  rain,  dust/smoke,  and  chaff. 

5-8.  A laser  radar  has  a beamwidth  of  100  prad  and  is  located  10  meters  above  a flat  plain.  It  is 
attempting  to  detect  and  track  incoming  cruise  missiles  also  at  10  meters  altitude  and 
nominal  10  km  range.  Will  multipath  fading  significantly  affect  the  performance  of  this 
system?  Why  or  why  not? 
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5-9.  Over  a spread  of  ranges  of  interest  a particular  radar  has  a CNR  which  falls  13-25  dB  below 
that  needed  to  reliably  detect  a class  of  inbound  threats.  Might  multipath  have  some 
significant  effect  on  increasing  the  detectability  of  those  threats?  Would  your  answer  change 
if  the  CNR  shortfall  was  only  3-15  dB? 

5-10.  Plot  the  probability  of  clear  line  of  sight  to  the  ground  as  a function  of  platform  altitude  for 
a ground  range  of  10  km.  Use  the  values  a = -0.0003,  b = 1.658,  and  c = -1.315  to  define  the 
terrain. 

5-11.  Identify  the  two  wavelength  regions  of  the  electromagnetic  spectrum  which  propagate  well 
in  sea  water. 

5-12.  A blue-green  laser  communication  system  can  tolerate  a total  extinction  of  100  dB  in  the 
direct  beam.  For  the  clearest  oceanic  water,  how  deep  can  this  system  transmit  information? 
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CHAPTER  6 


PROPAGATION  OF  ACOUSTIC  RADIATION 


What  is  Acoustic  Radiation? 

Acoustic  radiation  is  composed  of  stress  waves  in  a fluid  or  a solid.  In  a fluid  compression 
and  rarefaction  are  the  only  mechanieal  stresses  allowed.  Aeoustie  radiation  in  a fluid  will  be 
composed  of  pressure  waves,  as  shown  in  Figure  6- la.  However,  in  solid  materials,  shear  stresses 
are  also  possible.  Thus,  acoustic  radiation  in  a solid  can  have  shear  wave  components  as  well  as 
pressure  wave  eomponents,  as  shown  in  Figure  6- lb.  Seismic  waves,  which  are  really  nothing  but 
earth  motion-produeed  aeoustie  waves  propagating  through  the  ground  have  readily  apparent  shear 
and  pressure  components,  that  manifest  themselves  as  the  “s”  and  “p”  wave  components,  respec- 
tively, of  the  seismic  waves  seen  in  seismometer  recordings  of  earthquakes.  In  the  following  we  will 
consider  only  pressure  waves,  as  most  applications  have  air  or  water  as  the  propagation  medium. 


Figure  6-1.  Types  of  stress  waves  comprising  acoustic  radiation. 


a)  Fluids 


COMPRESSION 


b)  Solids 


COMPRESSION 


SHEAR 
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The  existence  and  properties  of  acoustic  pressure  waves  comes  from  consideration  of  three 
fundamental  equations  of  materials:  the  equation  of  motion,  the  continuity  equation,  and  the 
equation  of  state.  The  equation  of  motion  is  essentially  a statement  of  Newton’s  second  law  (F  = 
ma)  of  motion.  The  continuity  equation  is  an  expression  of  conservation  of  mass  in  the  system.  The 
equation  of  state  is  the  relation  between  density  and  pressure  in  the  material. 

The  lossless  (no  viscous  damping)  equation  of  motion  of  a fluid  is  [1] 

^ dv 

~^P  = Po^  (6-1) 

ot 


where  p is  the  total  pressure  including  ambient  and  wave  pressures,  pg  is  the  ambient  density,  and 
V is  the  velocity  of  the  fluid.  The  pressure  gradient  (-Vp)  is  the  net  force  acting  on  a volume  of  fluid 
of  unit  area  and  unit  thickness.  This  force  causes  the  mass  {pg  times  the  unit  volume)  to  accelerate 
with  acceleration  {dv/di).  The  continuity  equation  for  the  fluid  is  given  by 

^ + V.(av)  = 0.  (6.2) 

Ot 

The  continuity  equation  states  that  any  change  in  density  must  be  accompanied  by  a flow  of  material 
into  or  out  of  the  volume  under  consideration. 

The  equation  of  state  describes  the  pressure  as  a function  of  density 

P = p(p)  • (6.3) 

Using  the  chain  rule  for  derivatives,  it  is  easy  to  show  that  in  the  adiabatic  limit  (no  heat  transfer  into 
or  out  of  the  material),  we  obtain 


dp 

dt 


dp  dp 

dp  dt 

^ Po 


Defining  a quantity  c by  the  relation 


c 


2 


(6.4) 


(6.5) 


we  may  substitute  Eq.(6.4)  into  Eq.(6.2)  to  yield 
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(6.6) 


1 dp 


+ PoV-v  = 0. 


Taking  the  time  derivative  of  both  sides  of  Eq.(6.6)  yields 


1 d^P  V7  _ A 


c"  dt 


dt 


Substitution  of  Eq.(6.1)  into  this  result  yields 


or 


de  ^ 


vV-4^  = o. 


c"  dt^ 


(6.7) 


(6.8) 


(6.9) 


Equation  (6.9)  has  the  classical  form  of  a wave  equation  with  c being  interpreted  as  the  wave 
propagation  velocity. 


Equation  (6.9)  differs  from  the  electromagnetic  wave  equation  only  in  that  the  pressure  is 
a scalar  quantity  while  the  electric  and  magnetic  fields  are  vector  quantities.  As  a consequence, 
pressure  waves  do  not  exhibit  polarization.  Otherwise,  the  same  kinds  of  waves  that  satisfy  the 
electromagnetic  wave  equation  will  satisfy  the  acoustic  wave  equation.  Plane  wave  solutions  of  the 
form 


p{r,t)  ^ p^e 


icot 


(6.10) 


where  p is  the  (acoustic)  pressure  deviation  from  the  ambient  hydrostatic  pressure  (the  total 
pressure  is  the  sum  of  the  acoustic  pressure  and  the  hydrostatic  pressure)  and 


CO  = 


ck^  = 2ttc  ! X 


(6.11) 


are  permitted.  These  solutions  oscillate  sinusoidally  in  both  space  and  time  with  temporal  frequency 
cdl%  and  free-space  wavelength  X.  Spherical  waves  of  the  form 

^icot  + ikr 

p(r,t)  = Po (6-12) 

r 

are  also  viable  solutions  to  the  acoustic  wave  equation. 
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Associated  with  any  acoustic  pressure  level  p is  an  acoustic  intensity  / defined  by  the 
equation 

1 = ^^.  (6.13) 

2p,c 

Both  acoustic  pressure  and  acoustic  intensity  are  characterized  in  units  of  decibels.  That  is, 

/ (in  dB)  = 101ogiQ|^/(in  W/m^)/  (6.14a) 

or 

Po  (in  dB)  = 201ogio[;?o(  in  //Pa)  (6.14b) 

where  is  a reference  intensity  that  is  produced  by  a reference  sound  pressure  pg^p- 
atmospheric  acoustics,  the  reference  pressure  is  pg^g  = 20  pPa;  in  underwater  acoustics,  the 
reference  pressure  is  pg^p  = 1 pPa.  Because  I is  proportional  to  the  square  of  pg,  the  definition  of  the 
decibel  by  Equations  (6.14)  means  that  the  value  of  acoustic  intensity  in  decibels  is  equal  to  the 
value  of  the  acoustic  pressure  in  decibels  for  any  value  of  pressure  or  intensity.  That  is, 

/ (in  dB)  = (in  dB) . (6.15) 

This  duality  between  pressure  and  intensity  makes  it  extremely  convenient  to  use  the  logarithmic 
decibel  units  rather  than  the  conventional  units  of  Pascals  and  Watts/square  meter. 
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The  Speed  of  Sound 

For  an  ideal  gas  we  can  evaluate  the  expression  for  the  propagation  velocity  explicitly.  The 
equation  of  state  for  an  ideal  gas  may  be  written  as 


p = 


pRT 

M 


(6.16) 


in  its  isothermal  form,  where  R is  the  ideal  gas  constant  (=  8.31441  J/mole-K),  T is  the  gas 
temperature,  and  M is  the  molar  weight  of  the  gas  (=  0.028964  kg  for  air).  In  its  adiabatic  form  the 
equation  of  state  is 


where 


Po 


f Y 

P 


KPo) 

r = CpiCv 


(6.17) 

(6.18) 


is  the  ratio  of  specific  heats  (=  1.4  for  air  and  =1.667  for  monatomic  gases).  Taking  the  density 
derivative  of  Eq.(6.17)  yields  an  expression  into  which  subsequent  substitution  of  Eq.(6.16)  yields 


dp 

dp 


y-\ 


Po 


- PoY~ — 


= 


Po 


Po 


M 


(6.19) 


For  air,  we  can  easily  evaluate  Eq.(6.19)  to  yield 

c (in  m/ s)  = 20.05  (6.20) 

where  T is  in  K.  For  T = 300  K,  we  calculate  c = 347.3  m/s. 

In  Eq.  (6.20)  we  have  ignored  any  temperature  dependence  or  gas  mixture  dependence  of 
the  ratio  of  specific  heats.  To  first  order,  y for  air  may  be  considered  to  be  a constant  and  equal  to 
1 .4.  To  a higher  order  of  fidelity,  yhas  a weak  dependence  on  temperature,  carbon  dioxide  fraction, 
and  relative  humidity.  This  results  because  any  absorption  caused  by  molecular  species  must  be 
included  in  the  specific  heats  (and  thus  in  the  ratio  of  specific  heats).  If  we  denote  the  result  from 
Eq.(6.20)  by  the  symbol  Cg,  then  taking  the  molecular  absorptions  into  account  the  speed  of  sound 
may  be  written  as  [2] 
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where  the  terms  X^,  0„  and  a>^  are  defined  in  the  following  section  on  absorption  of  sound  waves. 
In  most  cases  we  may  neglect  the  more  complicated  behavior  of  the  speed  of  sound  in  air  and  use 
the  simple  result  of  Eq.(6.20). 

The  equation  of  state  for  sea  water  is  considerably  more  complex  than  that  of  an  ideal  gas. 
Direct  evaluation  of  Eq.(6.5)  is  therefore  impractical.  Empirical  measurements,  however,  have 
yielded  the  following  results  for  the  speed  of  sound  in  sea  water  [3] 

c = 1402.06  + 1.340^  - 0.0102575' 

+ 4.9507  - 0.053047^  + 2.374  x 10“^  7^  (6.22) 

+ 0.0163Z  + 1.675  X 10'"z'  - 7.139  x 10"'^  7z^ 

where  c is  the  speed  of  sound  in  m/s,  T is  the  temperature  in  °C,  S is  the  salinity  in  parts  per 
thousand  (which  are  equivalent  to  the  so-called  practical  salinity  units  (psu)),  and  z is  the  depth 
in  m. 


Both  temperature  and  salinity  are  highly  variable  with  respect  to  location  as  well  as  depth 
in  the  oceans  of  the  world.  Figure  6-2  illustrates  typical  seasonal  variations  in  temperature  vs.  depth 
at  three  different  latitudes.  Sea  surface  temperatures  range  from  as  high  as  30-35  °C  at  tropical 
latitudes  to  -1  or  -2  °C  in  Arctic  and  Antarctic  waters.  At  very  great  depths  (below  1000  m)  the 
temperature  tends  to  be  more  uniform  and  decreases  very  slowly  from  roughly  3-4  °C  to  as  low  as 
0.5-2. 5 °C  depending  on  location.  The  temperatures  tend  to  be  more  or  less  constant  in  a mixed 
layer  from  the  surface  to  depths  of  the  order  of  100-200  m and  then  decrease  with  increasing  depth 
until  the  nominal  isothermal  layer  is  reached.  At  depths  below  100-200  m there  is  generally  a 
decrease  in  temperature,  known  as  a thermocline.  The  portion  of  the  thermocline  that  varies 
seasonally  is  called  the  seasonal  thermocline.  At  greater  depths  (typically  below  400-600  m)  the 
seasonal  variations  tend  to  disappear  and  the  temperature  decrease  is  called  the  permanent 
thermocline. 

The  inflow  of  water  from  large  rivers  will  create  plumes  of  usually  warmer  water  that  may 
extend  hundreds  of  miles  into  the  open  ocean.  The  interaction  of  ocean  currents  with  the  oceanic 
topography  and  with  each  other  complicates  the  normal  behavior  of  temperature  versus  depth.  Cold 
water  can  be  forced  up  from  great  depths  to  mix  with  warmer  surface  waters.  Warm  currents  such 
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Figure  6-2.  Temperature  variation  with  depth  at  different  latitudes. 


TROPICAL  MID-LATITUDE  POLAR 


TEMPERATURE  deg  C TEMPERATURE  deg  C TEMPERATURE  deg  C 


as  the  Gulf  Stream  will  form  warm  and  cold  eddies  as  they  move  past  colder  waters.  Thus,  there  are 
many  ways  in  which  it  is  possible  for  layers  of  warmer  water  to  exist  between  layers  of  colder  water. 

The  salinity  of  the  ocean  is  as  variable  as  the  temperature.  The  average  salinity  at  the  surface 
of  the  North  Atlantic  is  36.5  psu,  while  it  is  only  35.0  in  the  Pacific  and  Indian  Oceans,  and  as  low 
as  33  psu  in  the  Arctic  Ocean.  In  the  Eastern  Mediterranean  Sea,  which  has  limited  freshwater 
inflow  and  severe  evaporation,  the  salinity  can  reach  as  high  as  39. 1 psu.  Evaporation  significantly 
increases  the  salinity  at  the  surface  of  the  ocean.  If  the  Straight  of  Gibraltar  were  to  become 
blocked,  it  is  estimated  that  the  Mediterranean  would  evaporate  completely  within  1 000  years.  Long 
before  it  completely  dried  up,  the  salinity  would  exceed  300  psu,  the  salinity  of  the  Dead  Sea  and 
other  salt  lakes.  Precipitation  can  significantly  reduce  salinity.  For  example,  1 cm  of  precipitation 
mixed  with  the  upper  10  m of  seawater  (the  amount  that  might  reasonably  be  mixed  on  time  scales 
of  a few  hours  in  moderate  Sea  States)  will  reduce  the  salinity  by  1 psu.  Freshwater  has  extremely 
low  salinity  (by  definition  less  than  3 psu).  High  flow  rivers  such  as  the  Mississippi  River  and  the 
Amazon  River  can  produce  plumes  of  freshwater  that  extend  for  hundreds  of  kilometers  before  they 
are  thoroughly  mixed.  Surface  and  subsurface  currents  carrying  water  from  high  salinity  regions 
to  low  salinity  regions,  and  vice  versa,  cause  a fairly  complex  variation  of  salinity  with  depth.  For 
example,  Figure  6-3  shows  the  undersea  saline  “waterfall”  pouring  over  the  sill  of  Gibraltar  from 
the  highly  saline  Mediterranean  Sea.  As  shown  in  Figure  6-4,  this  outflow  extends  almost  to  the 
East 
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Figure  6-3.  The  outflow  of  supersaline  Mediterranean  Sea  waters 
through  the  Straights  of  Gibraltar.[4],[5] 


coast  of  Florida  (the  Gulf  Stream  blocks  its  continued  westerly  flow). 

The  various  temperature  gradients,  thermal  inversion  layers,  salinity  gradients,  supersaline 
and  freshwater  plumes  produce  inhomogeneities  in  the  speed  of  sound  that  complicate  the 
propagation  of  acoustic  radiation.  These  speed  of  sound  variations  define  the  nature  of  acoustic 
propagation  in  the  underwater  environment.  The  peculiarities  of  underwater  sound  propagation  will 
be  discussed  in  a subsequent  section. 

Similar  stratified  variations  in  humidity  and  temperature  affect  the  speed  the  sound  in  air. 
These  variations  will  cause  atmospheric  acoustic  effects  almost  identical  to  those  that  are  observed 
in  underwater  acoustic  propagation. 
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Figure  6-4.  Spreading  c 
North  Atlantic  ocean 


diterranean  Sea  salinity  across  the 
:s  at  intermediate  depths. [4], [6] 


Attenuation  of  Acoustic  Radiation 


As  pressure  waves  propagate  through  the  air  or  water  medium,  a number  of  factors  work  to 
attenuate  the  pressure  levels  (or  intensities)  in  those  waves.  We  will  define  the  attenuation 
coefficient  a (in  nepers/m)  or  a (in  dB/m)  by  the  intensity  relation 


I{R)  = I(R,) 


( r\ 

U J 


(6.23) 


\Rj 


which  also  incorporates  the  spherical  wave  spreading  factor  (1/i?^).  This  is  a common  but  not 
universally  accepted  definition.  Using  this  definition,  the  pressure  attenuation  expression  is 


p(R)  = ^ \e  10  . (6.24) 

\R  J \ R J 


The  exponential  and  logarithmic  forms  of  the  attenuation  coefficients  are  related  by 


a - 4.3430a  and  a - 02303a  . 


(6.25) 


Figure  6-5  describes  the  various  factors  that  contribute  to  acoustic  attenuation.  Every  fluid 
has  a viscosity.  Viscosity  attenuates  acoustic  radiation  in  the  following  way.  Pressure  waves  consist 
of  alternating  high  pressure  and  low  pressure  regions.  As  the  wave  propagates,  the  fluid  molecules 
must  move  out  of  the  average  pressure  regions  into  the  high  pressure  regions  and  out  of  the  low 
pressure  regions  into  the  average  pressure  regions.  Once  the  wave  motion  has  established  the  high 
and  low  pressure  regions,  the  molecules  will  move  under  the  pressure  gradients  back  towards  their 
original  positions  to  reestablish  pressure  equilibrium.  Viscosity  resists  molecular  motion.  Thus,  the 
wave  energy  associated  with  the  molecular  motion  is  reduced  (converted  to  heat)  by  viscosity.  This 
reduces  the  pressure  differences  that  can  be  established  - the  fundamental  definition  of  acoustic 
attenuation. 

Thermal  diffusion  acts  to  attenuate  acoustic  radiation  in  the  following  fashion.  When  the 
pressure  wave  produces  high  and  low  pressure  regions,  the  high  pressure  regions  will  become 
warmer  and  the  low  pressure  regions  will  become  colder.  Thermal  diffusion  acts  to  move  energy 
from  warm  regions  to  cold  regions.  As  energy  diffuses  out  of  the  warmer  regions,  they  cool  and  the 
pressure  will  be  reduced.  As  energy  diffuses  into  the  cooler  regions,  they  warm  up  and  the  pressure 
will  be  increased.  Thermal  diffusion  acts  to  reduce  the  pressure  in  the  high  pressure  regions  and 
increase  the  pressure  in  the  low  pressure  regions  - once  again  this  reduction  of  pressure  differences 
is  the  definition  of  acoustic  attenuation. 

Vibrations  of  the  fluid  molecules  can  attenuate  acoustic  radiation  in  the  following  fashion. 
As  the  pressure  in  a region  increases,  it  heats  up.  As  it  heats  up,  the  molecules  will  try  to  establish 
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Figure  6-5.  Mechanisms  of  attenuation  of  acoustic  radiation. 
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molecules  increase  in  concert  with  the  new  Boltzmann  factors  However,  because  the 

vibrational  energies  often  exceed  the  thermal  energy  {kT),  vibrational  excitation  does  not  occur 
instantly.  It  takes  a number  of  collisions  before  the  geometry  and  kinetic  energies  of  the  molecules 
are  just  right  to  cause  a vibrational  excitation  to  occur.  In  the  same  way,  not  every  collision  allows 
stored  vibrational  energy  to  be  released.  Thus,  molecular  vibrations  slowly  absorb  energy  when  the 
high  pressure  regions  heat  up.  The  energy  absorption  tends  to  reduce  the  peak  temperature  and  thus 
the  peak  pressure  obtained.  When  the  pressure  falls  as  the  wave  moves  on,  the  excited  molecules 
do  not  immediately  release  their  stored  energy.  They  are  most  likely  to  release  their  energy  when 
the  temperature  is  lowest.  The  release  of  energy  tends  to  increase  the  temperature  and  pressure  of 
the  low  pressure  regions.  Thus,  the  in-phase  absorption  and  out-of-phase  release  of  energy  by  the 
vibrations  lowers  the  maximum  pressure  in  the  high  pressure  regions  and  increases  the  minimum 
pressure  in  the  low  pressure  regions  - the  fundamental  definition  of  acoustic  attenuation. 

In  a highly  analogous  fashion,  chemical  reactions  can  attenuate  acoustic  radiation.  Consider 
an  ionic  reaction  in  a solution 

AB  A*  + B~  + IS.E  (6.26) 

Under  normal  conditions,  the  chemical  salt  AB  dissolves  into  and  R ions  with  the  release  of  a 
small  amount  of  energy  AE.  At  high  pressures,  the  dissolved  species  are  driven  to  recombine 
(reactions  often  act  to  counteract  changes  in  the  environment  - fewer  species  leads  to  reduced 
pressure).  Formation  of  the  salt  causes  a pressure  reduction  by  reduction  of  the  number  of  fluid 
components  as  well  as  by  absorbing  energy.  In  regions  of  lower  pressure,  the  salt  will  tend  to 
dissolve,  but  this  process  is  not  instantaneous.  It  is  most  likely  to  occur  when  the  pressure  is  lowest. 
The  dissolution  process  releases  energy,  increasing  the  temperature,  and  thus  increasing  the 
pressure.  The  process  also  directly  increases  the  pressure  by  increasing  the  number  of  species. 
Thus,  the  in-phase  recombination  and  out-of-phase  dissolution  of  chemical  salts  lowers  the 
maximum  pressure  in  the  high  pressure  regions  and  increases  the  minimum  pressure  in  the  low 
pressure  regions  - once  again,  the  fundamental  definition  of  acoustic  attenuation. 

Many  entities  in  the  air  and  ocean  environments  can  scatter  acoustic  radiation.  Density 
fluctuations,  particulates  (silt,  dust,  aerosols,  raindrops,  etc.),  and  biological  species  (insects, 
plankton,  fish,  etc.)  all  act  as  scatterers  of  acoustic  radiation.  However,  contrary  to  the  first  four 
processes,  scattering  does  not  act  by  reducing  the  high  pressures  and  increasing  the  low  pressures. 
It  works  by  redirecting  acoustic  energy  away  the  original  directions.  This  direct  reduction  in  energy 
flow  produces  the  attenuation. 

Let  us  consider  first  the  attenuation  of  acoustic  radiation  in  air.  The  ANSI  standard  ANSI 
S1.26-I978  gives  detailed  instructions  for  calculating  the  attenuation  of  acoustic  radiation  in  air.  [7] 
This  publication,  however,  uses  a different  definition  of  the  attenuation  coefficient.  Specifically, 
it  defines  the  coefficient  a’  based  on  pressure  attenuation 

p(x)  = . (6.27) 
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The  ANSI  attenuation  eoefficient  is  related  to  our  eoefficients  a and  a by  the  simple  relations 


a -0.5a'  and  a-2.\12a'.  (6.28) 

Empirieally,  it  has  been  determined  that  only  viscosity,  thermal  diffusion,  and  vibrational  relaxation 
in  oxygen  and  nitrogen  are  major  contributors  to  absorption  in  air.  The  total  attenuation  coefficient 
is  the  sum  of  the  individual  contributions 

cc  — cCyjyuj^  + a + a . (6.29) 

Thermal  diffusion  and  viscosity  are  usually  treated  together.  Their  contribution  is  given  by 

= 1.84x10-" r-  (r/r„)'“  !(p, /pj  (6.30) 

where  v = (t/2:^ is  the  acoustic  frequency,  Tg  = 293.15  K,  and p^g  = 101325  Pa. 

The  contribution  from  vibrational  relaxation  in  nitrogen  is  given  by 

/ \ / \1  a IT  / Ct)fj  ) 

a„,  =(m/c„)(X,pi5)(0,pr)  ^ (6.31) 

where  the  speed  of  sound  is 

c,  = 343.23(T  / (6.32) 

the  partial  fraction  of  nitrogen  is 

= 0.781,  (6.33) 

and  the  characteristic  vibrational  temperature  for  nitrogen  is 

^^^=  3352.2  K.  (6.34) 

The  characteristic  frequency  for  nitrogen  is  given  by 

m^,_=24pPp„){T/T„)  ' 9 + 350/1  e'  ''  . (6.35) 

V J 

The  attenuation  produced  by  molecular  oxygen  is  treated  in  an  analogous  fashion,  with 
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«o,  = 


[coIc)[x,J?>5)[6oJT) 


\ + [col  COo^) 


where  c is  the  same  for  oxygen  as  for  nitrogen,  the  partial  fraction  of  oxygen  is 

= 0.209  , 


(6.37) 


and  the  characteristic  vibrational  temperature  for  oxygen  is 


00  = 2239.1  K. 


(6.38) 


The  characteristic  frequency  for  oxygen  is  given  by 


cOo=27T{pJp,,)\2A  + AA\x\Q^h 


0.05  + A 
0.391  + A 


(6.39) 


where  h is  the  molar  concentration  of  water  vapor  in  percent.  That  is, 


Ph^o  (ill  g / ) Y 1 OO(percent) 


1 8.0 1528(g/ mole)  II  44.61 48(mole/m')  IpaJ 


= 0.1244/?, 


(6.40) 


Pao  1^0 


where  is  p^2o  is  the  absolute  humidity  (in  g/m^)  at  atmospheric  pressure  and  temperature  T. 
Combining  all  of  the  contributions  yields 


a = 


(®/2+^{[i.84xio-‘'(p,/p,,r'(r/r,C] 

+ (r/7’„)-”{0.08030[v“’‘"]/[fflo^ 

+ (r  / r, {o,67  ® )]}} 


(6.41) 


For  typical  normal  values  of  temperature,  pressure,  and  humidity,  the  attenuation  as  a function  of 
frequency  is  shown  in  Figure  6-6.  Figures  6-7  and  6-8  show  the  effects  of  humidity  on  acoustic 
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attenuation  in  the  atmosphere.  These  curves  were  calculated  using  the  equations  above  with  the 
attenuation  coefficient  being  transformed  to  the  standard  version  used  throughout  the  remainder  of 
this  text  using  Eq.  (6.28). 


Figure  6-6.  Attenuation  of  acoustic  radiation  in  air. 
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Figure  6-7.  Effects  of  atmospheric  humidity  on  acoustic  attenuation  in  air  at  0°C. 
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Figure  6-8.  Effects  of  atmospheric  humidity  on  acoustic  attenuation  in  air  at  20°C. 
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Several  things  should  be  apparent  from  an  examination  of  Figure  6-6.  First,  high  frequencies 
have  higher  attenuations  than  low  frequencies.  This  means  that  if  acoustic  waves  are  to  be 
propagated  very  long  distances  through  the  atmosphere,  then  those  waves  must  have  relatively  low 
frequencies.  The  low  frequency  “rumble”  of  thunder  can  be  heard  for  many  miles;  the  high 
frequency  “crack”  can  only  be  heard  at  short  distances.  The  nominal  break  point  in  frequency 
occurs  at  1 kHz.  At  this  frequency,  the  attenuation  coefficient  is  roughly  5 dB/km.  At  higher 
frequencies,  propagation  for  a few  kilometers  will  produce  attenuations  of  tens  of  dB.  At  lower 
frequencies,  propagation  for  a few  kilometers  produces  relatively  small  attenuations.  Thus,  long 
range  detection  can  only  be  done  at  frequencies  below  1 kHz.  If  it  is  desired  to  make  long  range 
detection  difficult,  then  the  system  should  strive  to  produce  only  very  high  frequency  sound  and  any 
low  frequencies  should  be  damped  to  the  maximum  extent  possible. 

The  acoustic  attenuation  coefficient  in  sea  water  has  major  contributions  from  viscosity, 
chemical  relaxation,  and  scattering.  [4]  The  total  attenuation  coefficient  is  given 

^ ^Scattering  ^ Boric  Acid  ^ Magnesium  Sulfate  ^Viscosity  (6.42) 


The  scattering  component  is  essentially  independent  of  acoustic  frequency.  It  has  a value  of 


a 


Scattering 


= 6.9x10 


-7 


m 


-1 


(6.43) 


The  boric  acid  relaxation  term  involves  the  chemical  reaction 


H3BO3  . 


The  relaxation  contribution  to  the  attenuation  has  the  form 


4 -1 

^BoricAcid  ~ / \2  ^ 

X + ycot^) 


where  the  coefficient  A2  is  approximately 


4=3.269x10'"  s-m 


-1 


and  the  relaxation  time  constant  is 


— = 3.8 


,1/2 


1051 


f C A 7-- 

^ IqV  r+273.15j  „-l 

bsj 


(6.44) 


(6.45) 


(6.46) 


(6.47) 
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The  magnesium  sulfate  relaxation  involves  the  chemical  reaction 


MgS04  ^ + S04- 


(6.48) 


The  relaxation  contribution  to  the  attenuation  has  the  form 


' Magnesium  Sulfate 


J^S(s)f(p) 


(6.49) 


where  the  coefficient  S(s),  which  contains  an  explicit  salinity  dependence  S(s)  is  given  by 


45(5)  = 5- 3.73x10“", 


(6.50) 


the  relaxation  time  constant  is  given  by 


1 11 

- = 1.38x10^ 


(6.51) 


and  the  pressure  dependence  is  given  by 


P(v??)  = 1-6.46  X 10“  V. 


(6.52) 


where  is  the  ambient  hydrostatic  pressure  in  Pascals. 

The  viscosity  contribution  to  the  total  acoustic  attenuation  takes  the  form 


8 UCO  X 

W.scos.ty  =T 

3 pc 


(6.53) 


where  p is  the  density  of  the  seawater,  c is  the  speed  of  sound,  p is  the  effective  viscosity  and  is 
given  by 


//  = 2xl0^  kg-m  ^ - s~ 


(6.54) 


5 „-i 


= 9 X 10  5 


(6.55) 


Figure  6-9  shows  the  total  acoustic  attenuation  in  sea  water  and  the  contributions  to  the 
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attenuation  from  all  of  the  major  sources.  As  in  propagation  in  the  atmosphere,  acoustic  propagation 
in  sea  water  favors  low  frequencies.  The  nominal  cutoff  frequency  in  sea  water  is  20  kHz  where  the 
attenuation  coefficient  again  is  of  the  order  of  5 dB/km.  Frequencies  below  20  kHz  can  be  detected 
at  ranges  in  excess  of  a few  kilometers  (if  other  propagation  effects  permit),  while  frequencies  above 
20  kHz  will  be  severely  attenuated  after  a few  kilometers  of  propagation.  The  lowest  frequencies 
(1-100  Hz)  can  propagate  halfway  around  the  world  with  substantial  but  tolerable  attenuations. 


Figure  6-9.  Acoustic  attenuation  of  sea  water. 


213 


Figure  6-10.  Effects  of  temperature  and  depth  on  acoustic  attenuation  in  sea  water. 
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Reflection  and  Refraction  of  Acoustic  Radiation 


As  evidenced  in  Eqs.  (6.20)  and  (6.22),  the  speeds  of  sound  in  air  and  the  ocean  are  both 
functions  of  temperature.  The  speed  of  sound  in  the  ocean  is  also  a function  of  salinity  and  depth. 
Temperature  is  a highly  variable  function  of  altitude  in  the  atmosphere,  and  both  temperature  and 
salinity  vary  considerably  with  depth  in  the  ocean.  Thus,  non-zero  vertical  gradients  in  the  speed 
of  sound  are  expected.  By  analogy  with  the  speed  of  light  in  the  atmosphere,  which  also  exhibits 
strong  vertical  gradients  (Chapter  4),  we  expect  anomalous  refractive  propagation  of  acoustic 
radiation  in  both  the  atmosphere  and  in  the  ocean. 

Before  discussing  anomalous  behavior,  we  should  review  the  effects  of  normal  reflection  and 
refraction  at  an  interface  between  two  acoustic  media.  The  first  medium  is  characterized  by  speed 
of  sound  c I and  density  p,  while  the  second  medium  is  characterized  by  speed  and  density  P2-  For 
convenience,  we  may  define  an  acoustic  “refractive  index”  rj  by 

Tj  ^ REF  ^ ^ (6.56) 

where  Cj^p  is  an  arbitrarily  chosen  velocity.  Thus  we  may  define  a relative  refractive  index  n at  the 
interface  by 


n-Cy!  c^  - rj^l  rjy. 


(6.57) 


Figure  6-11.  Reflection  and  refraction  of  acoustic  radiation  at  an  interface. 


We  need  also  define  a density  mismateh  faetor  m by 


m = p^l  p,. 


(6.58) 


A ray  of  acoustie  energy  incident  on  the  interface  at  an  angle  ^ will  be  bent  exiting  the  interface  at 
an  angle  6^  with  the  angles  related  by  the  refractive  indices 


cosO-  cosO, 

^ = <=> 

Cl  C2 


77i  cos6'i  = 772  cos6'i  . 


(6.59) 


Eq.  (6.59)  is  the  acoustic  equivalent  of  Snell’s  Law.  Note  the  definition  of  angles  used  in  Urick’s 
presentation  (on  which  our  own  is  modeled)  is  just  the  opposite  of  that  used  in  electromagnetism. 
If  the  electromagnetic  definition  (angle  relative  to  the  surface  normal)  were  used,  then  the  cosines 
in  Eq.(6.59)  would  be  replaced  by  sines  and  the  expression  would  become  identical  to  Snell’s  Law 
(Eq.(4.6)). 

The  incident  acoustic  radiation  is  not  just  refracted,  a portion  of  the  radiation  is  reflected. 
The  fraction  of  radiation  that  is  reflected  is  given  by  the  expression 


r 2 2 

m sin  Of  - n - cos  6^ 

i~2  2 

m sin  n - cos  6^ 


(6.60) 


This  expression  exhibits  different  behavior  depending  on  the  relative  values  of  m and  n.  These 
behaviors  are  shown  in  Figure  6-12.  There  are  two  special  angles  that  become  apparent.  The  first 
of  these  is  “Brewster’s  Angle” 


= COS 


—n^ 


1/2 


(6.61) 


the  angle  at  which  the  acoustic  energy  is  completely  transmitted  through  the  interface  with  no 
reflected  component.  The  other  angle  is  the  “critical  angle” 

(9(2  = cos“^  n , (6.62) 

the  angle  at  which  the  acoustic  energy  is  completely  reflected  from  the  interface  with  no  transmitted 
component.  Both  angles  are  indicated  in  the  figure  (for  the  values  of  m and  n used  in  plotting  the 
figure).  In  either  air  or  water,  if  acoustic  radiation  encounters  a layer  with  strongly  different 
refractive  index,  the  ray  will  be  strongly  reflected.  Such  layers  may  be  used  to  hide  from  sonar 
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Figure  6-12.  Reflectance  versus  incidence  angle  for  several  regimes  of 
relative  refractive  index  and  density  mismatch. 


systems. 

Given  the  above  expressions,  let  us  consider  atmospheric  propagation  first.  The  vertical 
gradient  of  refractive  index  is  given  by 


drj  _ drj  dT 
dh  dT  dh 


(6.63) 


Substituting  Eq.(6.20)  into  Eq.(6.56)  and  differentiating  yields 


^(20.05r")~'  n 

dT  dT  20.05 1 2)  2T' 


(6.64) 
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From  Chapter  3,  we  recall  that  the  normal  lapse  rate  of  the  atmosphere  is  approximately  0.0064  K/m. 
Thus,  the  normal  atmospheric  gradient  of  sound  speed  is 


drj 

dh 


V 


" Vo.0064)  = i^^  , 


IT 


(6.65) 


Such  a refractive  gradient  will  cause  some  sort  of  acoustic  refraction.  By  analogy  with  the 
results  in  Chapter  4,  we  have  the  radius  of  curvature  defined  by 


f 


r = -77 


1 


drj ! dh 


cosO , 


(6.66) 


the  absolute  curvature  defined  by 


P = 


1 


drj 
TjcosO  dh 


(6.67) 


and  the  curvature  relative  to  the  earth’s  surface  defined  by 


, 1 1 drj 

p'  ^ — + 

rj  cos  6 dh 


(6.68) 


Upon  substitution  of  Eq.(6.65)  into  Eq.(6.68)  we  find  that  p’  is  positive  and  the  radiation  curves 
away  from  the  surface.  On  the  other  hand  if  a temperature  inversion  exists  at  some  height,  then  the 
refractive  index  gradient  can  be  negative.  If  the  gradient  is  sufficiently  negative  then  the  ray 
curvature  may  be  negative  and  the  acoustic  radiation  would  be  trapped.  The  requirement  for  this 
is 


or 


P 


1 


1 


rj  cos  6 


^ -rj^ 

\2T, 


dh 


1 


1 


dT 


R„  ITcosO  dh 


< 0 


(6.69) 


dT  IT  cosO 

— > 

dh  R^ 


(6.70) 


For  r=  300  K and  = 6.37x10®  m,  we  find  the  requirement  dT/dh  > 9.4x10  ® K/m..  This  is  rather 
easily  satisfied  by  almost  any  temperature  inversion.  The  effects  of  anomalous  propagation  are 
commonly  heard  near  airports.  On  days  when  a low-lying  inversion  is  present,  the  jet  noise  may 
disturb  individuals  miles  away  from  the  airport.  Yet,  as  soon  as  the  inversion  layer  disappears,  the 
jet  noise  abates. 
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Now  let  us  consider  propagation  in  the  ocean  medium.  By  analogy  with  electromagnetic 
waves,  we  expect  that  a medium  exhibiting  a strong  minimum  in  propagation  velocity  (maximum 
of  refractive  index)  will  exhibit  ducting.  A typical  sound  speed  versus  depth  profile  in  the  deep 
ocean  is  shown  in  Figure  6-13.  We  are  not  disappointed  in  that  there  is  at  least  one  such  minimum 
(at  900  m depth  in  this  profile).  Figure  6-14  shows  ray  tracings  for  this  profile  at  two  acoustic 
emission  depths  (one  near  the  surface  and  one  near  the  point  of  minimum  sound  speed). 


Figure  6-13.  Typical  sound  speed  versus  depth  profile  for  the  deep  ocean.  [8] 
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Figure  6-14.  Ray  tracing  for  the  sound  speed  profile  of  Figure  6-13.  [8] 
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Several  phenomena  are  apparent  in  these  ray  tracings.  First,  rays  emitted  near  the  velocity 
minimum  oscillate  regularly  about  that  minimum.  Those  making  small  angles  with  the  axis  of  this 
channel  oscillate  very  nearly  sinusoidally.  Those  with  larger  angles  exhibit  a significant  vertical 
asymmetry  characteristic  of  the  different  gradients  above  and  below  this  channel.  This  propagation 
channel  is  called  the  “deep  sound  channel”  or  the  “SOFAR”  channel.  [9]  Second,  rays  emitted 
nearer  the  surface  are  also  ducted  by  the  channel.  However,  they  tend  to  stay  bunched  together. 
Those  rays  initially  headed  toward  the  surface  are  immediately  either  reflected  from  the  surface  or 
refracted  back  towards  the  downward  direction  and  then  travel  in  the  company  of  those  rays 
originally  emitted  downwards.  After  crossing  the  axis  of  the  deep  sound  channel,  the  entire  ray 
bundle  is  then  refracted  back  to  the  surface.  The  rays  reach  the  surface  at  a relatively  well-defined 
range  (65  km  in  this  example).  The  rays  are  then  either  refracted  or  reflected  back  towards  the 
depths  and  the  process  repeats.  Each  time  the  rays  come  to  the  surface,  they  have  spread  much  less 
than  1/4tiR^  would  predict.  The  spreading  is  still  proportional  to  1/R^  at  long  ranges,  but  the 
constant  of  proportionality  is  much  closer  to  3 than  to  IMtt.  The  regions  where  the  ray  bundles 
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reach  the  surface  are  called  convergence  zones  and  are  regions  of  considerably  enhanced  signal 
detectability.  Between  the  convergence  zones  are  shadow  zones  where  rays  from  the  indicated 
source  simply  do  not  propagate.  Thus,  not  only  are  the  convergence  zones  regions  of  enhanced 
detectability,  they  are  separated  by  regions  of  virtual  undetectability  of  the  signals.  The  deep  sound 
channel  also  has  shadow  zones  near  the  surface  and  exhibits  weak  convergence  zone  effects. 
However,  in  the  deep  sound  channel  at  sufficient  ranges,  the  shadow  zones  essentially  disappear. 

The  problem  to  detection  posed  by  shadow  zones  deserves  some  additional  comment. 
Acoustic  media  possess  the  property  of  reciprocity  (ray  paths  are  reciprocal).  That  is,  the  direction 
of  the  energy  flow  does  not  matter.  Thus,  if  the  noise  emitted  by  a submarine  experiences  a specific 
transmission  loss  on  a path  (whether  that  path  is  a direct  refracted  path,  a surface  reflection  path  or 
a bottom  reflection  path  as  indicated  in  Figure  6-15)  to  a sonobuoy,  then  the  reciprocal  path  from 
the  sonobuoy  to  the  submarine  must  experience  the  same  transmission  loss.  If  a sonobuoy  is  in  a 
shadow  zone  relative  to  the  noise  emitted  by  a submarine,  then  the  submarine  is  in  the  equivalent 
of  the  shadow  zone  of  any  active  emissions  from  the  sonobuoy.  Transmission  loss  on  a path  is  the 
same  regardless  of  whether  the  path  is  an  active  or  passive  path  or  the  direction  of  the  path.  That 
is  the  essence  of  reciprocity. 

Figure  6-15.  Reciprocity  in  acoustic  transmission  and  reception. 

PATH  LOSSES  BETWEEN  TWO  POINTS  ARE  INDEPENDENT 
OF  WHICH  IS  TRANSMITTING  AND  WHICH  IS  RECEIVING 


It  should  be  noted  in  every  instance  that  as  soon  as  the  acoustic  radiation  has  propagated 
more  than  a few  kilometers,  the  ubiquity  of  refractive  bending  renders  meaningless  any  angle  of 
arrival  information  in  the  vertical  direction.  That  is,  elevation  angle  cannot  by  itself  be  used  to 
determine  the  depth  of  the  target.  However,  in  an  active  sonar  system,  the  addition  of  range  data 
coupled  with  knowledge  of  the  refractivity  of  the  medium,  would  permit  accurate  estimation  of 
target  depth.  In  most  cases  azimuthal  angle  of  arrival  information  is  of  use  in  determining  target 
location.  However,  if  the  propagation  distance  is  long  enough,  it  may  traverse  a major  current  or 
river  outflow.  Since  these  are  notorious  for  having  significant  temperature  and/or  salinity  gradients, 
there  presence  can  cause  significant  refraction  in  the  azimuthal  direction.  This  would  cause  a 
degradation  in  the  location  utility  of  the  angle  of  arrival  information. 

To  this  point  our  arguments  have  been  qualitative.  It  would  be  useful  to  have  some  relations 
useful  for  predicting  underwater  refractive  behavior.  Let  us  begin  with  the  surface  layer.  Potential 
variability  in  salinity  and  temperature  with  depth  can  produce  gradients  of  almost  any  type.  The 
radius  of  curvature  expression,  Eq.(6.66),  can  be  rewritten  in  terms  of  the  sound  speed  and  depth, 


r = -rj 


dr]  I dh 


^ ( 

cos6^  = c 


1 


del  dz 


cosO , 


J 


(6.71) 


Figure  6-16. 


Effects  of  surface  layer  gradients  on  shallow  source  propagation. 
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where  now  a positive  radius  of  curvature  corresponds  to  rays  bending  towards  the  ocean  surface. 
In  the  upper  diagram  of  Figure  6-16  the  sound  speed  gradient  dc/dz  is  negative  and  linear.  The  rays 
emitted  roughly  parallel  to  the  surface  (cos  0 ~ 1)  will  have  a constant  radius  of  curvature  and  bend 
away  from  the  surface.  If  the  acoustic  emitter  is  at  a nonzero  depth  there  will  be  one  angle  of 
emission  at  which  the  ray  just  grazes  the  surface.  Rays  emitted  more  vertically  will  be  reflected 
back  in  the  downward  direction.  There  is  a region  in  which  no  rays  can  propagate  - a shadow  zone. 
If  the  emitter  is  at  depth  z^,  the  range  Rshadow  to  the  beginning  of  this  shadow  zone  is 


R 


SHADOW 


= [2rzs-zs)  = 


^c(zs)z 

dddz 


,1/2 


--Zc 


(6.72) 


If  the  sound  speed  gradient  is  positive  and  linear,  as  in  the  lower  diagram  in  Figure  6-16,  then  the 
rays  bend  toward  the  surface.  In  this  case  there  is  no  shadow  zone.  Each  ray  will  bend  toward  the 
surface  until  it  strikes  the  surface  and  reflects.  The  angle  of  reflection  is  equal  to  the  angle  of 
incidence.  The  reflected  ray  will  curve  back  towards  the  surface  until  the  ray  either  leaves  the  linear 
gradient  or  the  ray  strikes  the  surface  again. 

Because  the  sea  surface  is  not  flat,  but  is  modulated  by  a multiplicity  of  wave  patterns,  some 
of  the  radiation  reaching  the  surface  will  be  scattered  into  all  directions.  This  scattering  causes 
reverberation  (multiple  overlapping  echoes  that  act  as  a noise  source)  in  active  sonar  systems.  It 
also  attenuates  the  acoustic  signal.  It  has  been  empirically  determined  that  each  reflection  from  the 
sea  surface  imposes  a loss  (in  dB)  [10] 

b - 1.04  • (Sea  State)  • Vv  (6.73) 


where  v is  the  acoustic  frequency  in  kHz  and  (Sea  State)  is  the  sea  state  number.  Overall 
transmission  loss  is  increased  by  a term  proportional  to  the  number  of  surface  bounces  in  the  path. 

If  the  acoustic  radiation  encounters  the  sea  bottom  before  it  is  refracted  back  towards  the 
surface,  then  an  analogous  reflection  process  will  occur.  The  reflection  will  be  accompanied  by 
significant  scattering  and  transmission  loss.  Figure  6-17  shows  the  reflection  of  common  ocean 
sediment  as  a function  of  frequency  and  angle  of  incidence. [1 1]  The  situation  is  complicated  by 
considerable  variability  from  location  to  location  even  at  a single  frequency.  Figure  6-18  shows  a 
series  of  typical  loss  vs.  angle  profiles  for  the  1 -4  kHz  frequency  region.  [8]  Each  number  represents 
a different  bottom  type  (characterized  by  the  reflection  behavior  not  any  geological  or  oceanographic 
character).  Depth  sounding  charts  sometimes  indicate  the  bottom  type  as  represented  by  these  or 
a similar  set  of  curves.  As  with  surface  bounces,  the  total  path  attenuation  will  contain  a term 
proportional  to  the  number  of  bottom  bounces  encountered. 

In  many  instances  the  water  in  the  surface  layer  is  fairly  well  mixed.  This  mixed  state  should 
be  accepted  as  the  norm.  Temperature  and  salinity  gradients  are  the  exception.  If  there  is  no 
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Figure  6-17.  Bottom  reflection  loss  as  a function  of  frequency  and  angle  of  incidence.[l  1] 
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Figure  6-18.  Bottom  reflection  loss  over  1-4  kHz  frequency  range  for  different  bottom  types. [8] 
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temperature  or  salinity  gradient,  then  ignoring  higher  order  correetions,  from  Eq.(6.22)  we  determine 
the  speed  of  sound  gradient  to  be 

— « 0.0163  m ^ (6.74) 

dz 

If  we  assume  normal  salinity  (S=35  psu)  and  ignore  high  order  terms  in  Eq.(6.22),  we  ean  obtain  an 
approximate  relation  for  the  sound  speed 

c « 1449  + 53T  - 0.053r"  + 0.0 1 63z . (6.75) 

For  rough  estimates  we  may  simplify  this  further  by  picking  r=15  °C,  yielding 

c«1516  m/s  (6.76) 

which  is  accurate  to  ± 5%  over  the  typical  0 to  30  °C  range  of  temperatures. 

If  the  bottom  of  the  mixed  layer  is  at  depth  Dj  then  the  skip  distance,  the  distance  between 
reflections  off  the  surface  of  a ray  that  barely  grazes  the  bottom  of  the  mixed  layer  (as  identified  in 
Figure  6-19),  can  be  determined  from 

R,^{4D,(2r-D,)f"  (6.77) 

the  formula  for  the  length  of  a chord  of  a circle  of  radius  r and  chord  center  to  circle  distance  D;, 
where  the  radius  of  curvature  may  be  approximated  by 

r = — ^ = 93,000  m . (6.78) 

dcjdz 

These  expressions  reduce  to  [9] 

Rs  » » 8637a  ■ (6-79) 

The  speed  of  sound  profile  in  the  deep  sound  channel  is  not  parabolic  (for  one  thing  it  is 
strongly  asymmetric  about  the  minimum)  but  neither  is  it  truly  bi-linear  - the  combination  to  two 
linear  gradient  segments  - (although  the  lower  half  quickly  assumes  the  linear  gradient  associated 
with  the  deep  isothermal  layer.  Empirically  it  is  found  that  a bi-linear  approximation  is  much  better 
than  a parabolic  approximation  to  this  channel.  In  a bi-linear  approximation,  the  total  repeat 
distance  or  “wavelength”  will  be  the  sum  of  the  contributions  from  the  upper  linear  segment  and  the 
lower  linear  segment.  Thus,  the  upper  and  lower  gradients  may  be  averaged  without  error.  One 
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Figure  6-19.  Geometry  for  computation  convergence  zone  range  and  skip  distance. 
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estimator  of  the  gradient  is  to  take  the  difference  between  the  maximum  velocity  at  the  top  of 
the  channel  and  the  minimum  velocity  in  the  middle  of  the  channel  and  divide  by  half  of  the 
total  height  of  the  channel  (Z)^  -Dj).  Using  Eq.(6.79)  we  may  then  write  the  expression  for  the 
“wavelength”  as 

~ ^UPPER  ^LOWER 

^MAX  ~ ^MIN 

r V' 

= 242(D^-D,)  ^ 

V ^MAX  ~ ^MIN  J 

The  range  to  the  convergence  zone  is  the  sum  of  the  channel  “wavelength”  plus  any  effect  due  to 
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the  mixed  layer 


^cz  - + "> 


R. 


0.5i?c 


0 


- if  both  source  and  receiver  are  near 
the  top  of  the  mixed  layer 

- if  one  is  near  the  top  of  the  mixed 
layer  and  the  other  is  near  the  bottom 

- if  both  are  near  the  bottom  of  the  mixed 

layer  or  if  there  is  no  mixed  layer 


(6.81) 


The  equations  above  are  merely  rough  estimates.  Mueh  more  aeeurate  predietions  could  be  obtained 
by  actually  performing  the  ray  tracing  for  measured  velocity  profiles  (if  available)  using  the 
technique  described  in  Chapter  4. 
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Seismic  waves 


So  far  we  have  said  nothing  about  the  third  major  acoustic  propagation  medium  - solid 
ground.  Seismic  waves  do  play  a significant  role  in  some  military  operations.  For  example,  there 
have  been  numerous  attempts  to  gain  a tactical  advantage  by  tunneling  under  enemy  fortifications. 
The  “Crater”  of  the  Civil  War  siege  of  Petersburg  and  the  North  Korean  tunnels  under  the 
Demilitarized  Zone  are  two  significant  examples.  In  the  latter  case  seismic  waves  have  been 
exploited  to  monitor  such  activity.  Seismic  waves  have  also  been  exploited  to  remotely  detect 
vehicular  and  personnel  traffic. [12] 

In  a linear  elastic  homogeneous  solid,  the  acoustic  equations  are  obtained  from  Cauchy’s 
equation  of  motion  [13],[14] 


df  dXj 


(6.82) 


where  p is  the  normal  material  density,  is  the  i*  Cartesian  component  of  the  displacement 
from  their  normal  position  of  particles  normally  at  position  x,  the  quantities  <jy  are  the  components 
of  the  stress  tensor  (applied  forces  per  unit  area),  and  s^j  are  the  components  of  the  strain  tensor 
(macroscopic  changes  in  length  per  unit  length).  The  stress  is  related  to  the  strain  by 

3 

(6-83) 

k=\ 


with  the  strains  defined  by 


dx-  dx- 

J ^ 


(6.84) 


and  where  5y  is  the  Kronecker  delta  {dy  =1  if  i=j,  and  5y  = 0 if  z^f),  and  T and  p are  the  Lame 
constants.  These  constants  are  related  to  common  material  parameters  by  the  relations 


and 


;i  = 


vE 

(l  + r)(l-2r) 


p = G = 


E 

2(1+1/) 


(6.85) 


(6.86) 


where  E is  the  elastic  modulus  (or  Young’s  modulus  is  the  ratio  of  extensional  stress  to  the  resulting 
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extensional  strain  in  a cylinder  pulled  on  from  both  ends),  vis  Poisson’s  ratio  (the  ratio  of  lateral 
contraction  of  a cylinder  being  pulled  on  from  both  ends  to  its  longitudinal  extension),  and  G is  the 
shear  modulus  (the  ratio  of  shear  stress  to  the  resulting  shear  strain  - or  transverse  displacement  per 
unit  length).  E and  v are  seen  to  be  related  to  1 and  ju  by 


and 


^ ^ {3X  + 2^) jU 
A + jU 

A 

^ = r- 

2(/l  + ju) 


(6.87) 


(6.88) 


The  bulk  modulus  B (the  ratio  of  hydrostatic  pressure  to  the  resulting  volume  change  - a measure 
of  incompressibility)  is  related  to  A and  //  by 


3A  -h  2fj 
3 


(6.89) 


and  to  E and  vby 


B = 


E 

3(1-2//)' 


(6.90) 


Usually  at  least  two  of  the  quantities  E,  B,  G,  and/or  v can  be  found  for  a material.  The  other 
quantities  can  be  derived  by  suitable  manipulation  of  the  above  parameters. 

Substitution  of  the  stress-strain  relation  into  Cauchy’s  equation  yields  a “wave  equation” 

P^  = (A  + 2m-h)V{V-4)  + MV^-^  (6.91) 

If  the  displacements  are  irrotational  (Vx,^=  0)  or  purely  compressional  in  nature,  then  one  can  set 
^ = VO  (6.92) 

and  rewrite  Eq.  (6.91)  as 


v'o 


1 a^o 


= 0 


(6.93) 
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where 


2 _ X + 2/U  _ E[\-v) 


(6.94) 


is  the  compressional  (P)  wave  velocity. 

On  the  other  hand,  if  the  displacements  are  solenoidal  (V-<^=  0)  or  purely  shear  in  nature, 
then  one  can  set 


Vx^ 

and  rewrite  Eq.  (6.91)  as 


where 


1 


= 0 


.2  ^ ^ E 

p 2/?(l  + v) 


(6.95) 


(6.96) 


(6.97) 


is  the  shear  (S)  wave  velocity, 
equations. 


Both  Eqs.  (6.93)  and  (6.96)  have  the  form  of  classical  wave 


Earthquakes  are  the  source  of  the  strongest  seismic  waves.  Both  P and  S waves  are  produced 
by  earthquakes.  As  we  shall  see,  the  P waves  travel  faster  than  the  S waves.  The  sharp  jolt  that 
often  precedes  the  strong  shaking  of  an  earthquake  is  due  to  the  passage  of  the  P waves.  The  slower 
S waves  are  also  longer  lasting  and  more  destructive.  They  cause  much  of  the  side-to-side  and  up- 
and-down  motions  that  are  observed.  The  difference  in  S and  P wave  velocity  is  used  to  determine 
the  range  to  the  epicenter  of  the  earthquake  by  measuring  the  time  between  the  arrival  of  the  P and 
S waves.  The  distance  d the  earthquake  waves  have  traveled  is 


d = c,T,=cJ,=-^J^{T,-T,) 

Cp 


(6.98) 


where  Tp  is  the  time  between  the  earthquake  event  and  the  detection  of  the  P waves  and  Tg  is  the 
time  between  the  earthquake  event  and  the  detection  of  the  S waves. 


The  S and  P waves  are  bulk  (or  body)  waves  and  travel  through  the  mass  of  the  earth. 
Earthquakes  can  also  generate  surface  waves.  These  waves  occur  only  at  the  earth’s  surface. 
Motions  are  restricted  to  roughly  one  wavelength  or  less  in  depth.  There  are  two  common  forms  of 
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surface  waves:  Rayleigh  waves  and  Love  waves.  Love  waves  are  horizontal  shear  waves  that  only 
occur  in  areas  with  strong  horizontal  stratification  near  the  surface.  They  travel  more  slowly  than 
S waves  (velocity  approx.  2/3  Cg).  Rayleigh  waves  are  waves  that  combine  vertical  and  axial 
motions.  A point  on  the  surface  moves  up  and  down,  forward  and  backward  in  a circular  motion. 
Rayleigh  waves  have  the  slowest  propagation  velocity  (approx.  ‘A  c^),  last  the  longest,  and  often 
have  the  largest  amplitude  (even  larger  than  S waves).  Because  they  combine  vertical  andback-and- 
forth  motion,  they  often  cause  most  of  the  destruction  produced  by  an  earthquake.  Besides  their 
importance  in  earthquake  damage.  Love  and  Rayleigh  waves  are  important  because  they  are 
generally  not  produced  by  nuclear  explosions.  This  coupled  with  the  tendency  for  explosions  to 
produce  a relative  strong  P wave  makes  it  difficult  to  disguise  a nuclear  explosion  as  an  earthquake. 

Attenuation  mechanisms  in  solids  (especially  rocks  and  soil)  are  not  as  well  known  as  in 
fluids  and  gases.  Nevertheless  it  has  been  found  that  attenuation  can  be  described  by  an  exponential 
attenuation  coefficient  a.  The  peak-to-peak  amplitude  ^ of  a seismic  wave  decays  with  propagation 
distance  as 


A(L)^A(0)e  . (6.99) 

The  attenuation  coefficient  appears  to  increase  linearly  with  increasing  frequency,  just  like  the 
viscous  damping  contributions  to  the  acoustic  attenuation  coefficients  in  air  and  water.  For 
convenience,  the  attenuation  coefficients  are  often  listed  as 

a{f)^aj  (6.100) 

where  has  units  of  sec/meter  and /is  the  acoustic  frequency  in  Hz. 

In  Table  6-1  we  have  listed  the  speeds  of  sound  in  a number  of  rock  and  soil  types.  [15]  For 
comparison  the  table  also  includes  the  speeds  of  sound  for  a number  of  solid  materials  used  in  the 
fabrication  of  military  equipment.  [16]  Where  available,  both  shear  and  compression  wave  velocities 
are  included.  In  Table  6-2  the  acoustic  parameters  (sound  speeds  and  attenuation  coefficients)  of 
several  rock  samples  are  described.  [17]  Examination  of  this  last  table  shows  that  at  frequencies  of 
1 kHz  most  samples  have  attenuation  coefficients  in  excess  of  1 km  '.  Such  high  frequencies  will 
not  propagate  for  distances  more  than  a few  kilometers.  However,  at  frequencies  below  a few  Hz, 
the  attenuation  coefficients  are  of  the  order  of  0.0 1 km'  or  less.  Very  low  frequency  seismic  waves 
can  easily  propagate  around  the  world  without  excessive  attenuation. 
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Table  6-1.  Speeds  of  sound  in  various  solid  materials. [15], [16] 


Speed  of  Sound  (m/s) 


Material 

Density 

Ranae  of  values 

Comoression 

Shear 

Loose,  Dry  Soils 

- 

180-1000 

- 

- 

Clay 

- 

760-1900 

- 

- 

Wet  Soils 

- 

760-1900 

- 

- 

Compact  Soils 

- 

910-2600 

- 

- 

Sandstone 

- 

910-4300 

- 

- 

Shale 

— 

1800-5300 

— 

— 

Limestone/Chalk 

- 

2100-6400 

- 

- 

Metamorphic  Rock 

- 

3000-6400 

- 

- 

Volcanic  Rock 

— 

3000-6700 

— 

— 

Solid  Plutons 

— 

4000-7600 

— 

— 

Jointed  Granite 

— 

2400-4600 

— 

— 

Weathered  Rock 

— 

600-3100 

— 

— 

Aluminum 

2.7 

- 

6420 

3040 

Iron 

7.9 

— 

5950 

3240 

Stainless  Steel 

7.9 

- 

5790 

3100 

Titanium 

4.5 

— 

6070 

3125 

Fused  Silica 

2.2 

— 

5968 

3764 

Plexiglass 

1.18 

- 

2680 

1100 

Polyethylene 

0.90 

- 

1950 

540 

Table  6-2.  Characteristics  of  seismic  waves  in  selected  rocks. [17] 
In  the  loss  parameter  expressions /is  the  acoustic  frequency  in  Hz 


Speed  of  Sound  (M/s) 

Attenuation  Coefficient  (sec/m) 

Material 

Compression 

Shear 

Compression 

Shear 

Granite: 

Kamyk 

6100 

3500 

3.9x  10'^f 

— 

Sample  3 

5500 

3300 

2.7x  10'^f 

3.3x  lO'^f 

Sample  4 

4880 

2990 

1.4x  10'^f 

1.7x  lO'^f 

Limestone 

Solenhofen 

5560 

2900 

5.2  X 10'^f 

5.8x  lO'^f 

Sample  1-1 

6100 

3200 

3.1  X 10'^f 

2.5  X lO'^f 

Sandstone 

Sample  1 1 6 

5000 

— 

3.5  X 10'^  f 

— 

Chalk 

Chislehurst 

2300 

1240 

1 .0  X 1 0'^  f 

1.2  X 1 O'*  f 

Shale 

Pierre 

2200 

810 

4.5x  10'^f 

4 X 1 O'*  f 
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In  some  seismic  references,  attenuation  is  described  in  terms  of  a Q factor.  The  Q of  any 
oscillator  is  the  fractional  energy  loss  per  cycle  of  oscillation.  It  can  be  related  to  an  attenuation 
coefficient  by 


a^CoUcQ  (6.101) 

where  co  is  the  angular  frequency  of  the  radiation  and  c is  the  appropriate  sound  velocity. 

The  primary  vertical  changes  in  seismic  velocity  are  due  to  changes  in  rock  composition  with 
changes  in  depth.  However,  it  has  been  determined  that  except  for  discontinuities  caused  by 
composition  changes,  the  sound  velocity  slowly  increases  with  increasing  depth.  The  positive  sound 
speed  gradient  means  that  seismic  waves  will  bend  towards  the  surface  of  the  earth.  Table  6-3 
shows  the  sound  speed  characteristics  of  typical  oceanic  crust.  At  the  surface  there  is  a roughly  1 .5 
m/sec  per  meter  of  depth  gradient  in  the  P wave  velocity.  Recalling  Eq.  (6.71)  for  the  radius  of 
curvature  of  an  acoustic  wave  in  a linear  velocity  gradient,  we  calculate  a radius  of  curvature  of 
approximately  1.5  km.  Surface  ducting  is  almost  certain  to  occur  in  this  medium.  Note:  however, 
that  the  crust  under  the  ocean  is  almost  certainly  saturated  with  water  and  under  considerable 
hydrostatic  pressure.  Consequently,  this  may  not  be  a good  model  for  estimating  near-surface 
seismic  properties  on  dry  land.  Gradients  at  deeper  depths  are  typically  much  less  strong  (barring 
interfaces  between  different  materials)  and  radii  of  curvature  can  exceed  100  km  or  even  1000  km 
along  deep  subsurface  paths.  The  curvature  is  of  clear  significance  in  determining  the  distance  to 
distant  (lOO's  to  lOOO's  of  km)  seismic  events  (earthquakes  or  underground  nuclear  tests). 


Table  6-3.  MARMOD,  a generic  marine  seismic  model  of  the  oceanic  crust.[14] 


DEPTH  (km) 

(km/sec) 

Cs  (km/sec) 

P (fl/cm^) 

0.0 

4.50 

2.40 

2.0 

1.5 

6.80 

3.75 

2.8 

6.0 

7.00 

3.85 

2.9 

6.5 

8.00 

4.60 

3.1 

10.0 

8.10 

4.70 

3.1 

Linear  gradients  are  assumed  between  points. 
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Problems 


6-1.  What  are  the  speeds  of  sound  in  air  (at  sea  level)  and  in  water?  Give  a numerical  answer 
accurate  to  20%.  Note:  these  are  numbers  that  should  be  committed  to  memory. 

6-2.  What  is  the  speed  of  sound  in  helium  at  a temperature  of  300  K? 

6-3.  Plot  the  speed  of  sound  vs.  depth  for  the  following  data: 


Denthlm) 

TemneraturelC) 

Salinitv(nsu) 

0 

25 

34 

100 

25 

34 

200 

24 

34 

300 

20 

34 

400 

15 

34 

500 

10 

34 

600 

5 

34 

700 

2 

34 

800 

2 

34 

2000 

2 

34 

2100 

2 

38 

What  peculiar  propagation  characteristics  might  this  sound  speed  profile  produce? 

6-4.  Compare  and  contrast  the  frequency  behavior  of  acoustic  attenuation  in  air  and  in  water. 

6-5.  A source  has  a sound  pressure  level  of  80  dB  at  a frequency  of  1 kHz  at  a distance  of  1 m 

from  the  source.  Assuming  the  threshold  for  audibility  of  sound  at  1 kHz  is  20  dB  and 
spherical  spreading  of  the  sound  waves,  estimate  the  distance  in  air  at  which  this  source  can 
barely  be  heard.  Estimate  the  distance  in  water  at  which  this  same  source  can  barely  be 
heard. 

6-6.  How  does  acoustic  reflection  from  a material  interface  differ  from  electromagnetic  reflection 

from  a material  interface? 

6-7.  A source  and  a receiver  are  at  the  surface  of  a body  of  water.  The  water  has  a mixed  layer 
100  m deep  and  a deep  sound  channel  with  maximum  depth  of  2 1 00  m.  The  speed  of  sound 
is  1510  m/s  at  the  top  of  the  deep  sound  channel  and  1490  m/s  at  the  middle  of  the  channel. 
At  what  range  should  the  first  convergence  zone  be  expected? 

6-8.  A helicopter  drops  passive  sonobuoys  at  ranges  of  40,  80,  120,  160,  and  200  km  from  the 
source  in  problem  6-7.  Which  if  any  of  these  sonobuoys  are  likely  to  detect  the  source? 
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6-9.  An  underground  river  flows  into  the  ocean  from  a submerged  cliff  face  at  a depth  of  300  m. 
The  flow  rate  is  so  high  and  the  local  currents  are  such  that  an  unmixed  layer  of  fresh  water 
50  m thick  extends  from  the  river  mouth  out  to  a distance  of  200  km.  The  river  water 
temperature  is  10  °C  whereas  the  ocean  water  temperature  is  5°C.  Salinity  of  the  ocean 
water  is  35  psu.  Will  this  river  create  an  acoustic  ducting  layer  or  a reflecting  layer?  Justify 
your  answer  by  calculations. 

6-10.  Compare  and  contrast  the  propagation  of  electromagnetic  waves  in  the  atmosphere  with  the 
propagation  of  acoustic  waves  in  sea  water.  Qualitative  comparisons  will  suffice. 

6-11.  Aluminum  has  a density  of  2.6  x 10^  kg/m^,  an  elastic  modulus  of  70  GPa,  and  a shear 
modulus  of  28  GPa.  Estimate  the  compressional  and  shear  speeds  of  sound  in  aluminum. 

6-12.  A seismic  sensor  placed  in  a limestone  formation  detects  the  sound  of  digging  with  pickaxes. 
The  shear  wave  component  from  each  blow  of  the  pick  arrives  50  ms  after  the  compressional 
wave  component.  How  far  away  from  the  sensor  is  the  source  of  the  digging? 
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CHAPTER  7 


NUCLEAR  RADIATION:  ITS  ORIGIN  AND  PROPAGATION 


The  Nuclear  Radiation  Particle  “Zoo” 

Particle  physics  has  proven  the  existence  of  many  dozens  of  subatomic  or  nuclear  particles. 
Of  these  less  than  a dozen  will  be  encountered  in  common  military  situations.  We  describe  these 
critical  few  below.  The  particles  are  often  categorized  by  the  nature  of  their  physical  characteristics, 
the  ways  in  which  they  are  produced,  and  the  way  in  which  they  interact  with  other  particles.  These 
categorizations  are  also  discussed  below. 

Nuclear  particles  are  subject  to  two  or  more  of  four  fundamental  forces.  Gravity  is  the  force 
of  attraction  between  masses  (or  more  properly  between  volumes  of  mass-energy).  All  fundamental 
particles  are  subject  to  gravity.  However,  the  gravitational  force  is  so  weak  that  outside  of  collapsed 
stellar  objects  (white  dwarfs,  neutron  stars,  and  black  holes)  it  seldom  has  an  impact  on  nuclear 
processes.  The  electromagnetic  force  is  the  force  of  attraction  or  repulsion  between  charged 
particles.  Charged  nuclear  particles  are  subject  to  the  electromagnetic  force.  The  weak  force  is  a 
short-range  force  that  manifests  itself  only  in  radioactive  decay  processes.  Leptons  and  hadrons  are 
subject  to  the  weak  force.  The  strong  (or  nuclear)  force  is  a short-range  force  that  causes  baryons 
to  bind  together  and  form  nuclei.  It  also  governs  the  major  interactions  between  hadrons.  Only 
hadrons  are  subject  to  the  strong  force. 

A lepton  is  any  one  of  six  fundamental  particles  with  spin  Vi  which  are  capable  of  interacting 
via  the  weak  force  but  not  the  strong  (or  nuclear)  force.  The  six  leptons  are  electrons,  muons,  and 
tau  particles,  electron  neutrinos,  muon  neutrinos,  and  tau  neutrinos.  Leptons  have  a lepton  with  anti- 
leptons (the  anti-particles  of  leptons)  having  the  negative  lepton  number  of  the  corresponding  normal 
leptons.  Lepton  number  is  conserved  in  nuclear  reactions. 

A baryon  is  any  half-integer  spin  and  usually  very  massive  fundamental  particle  capable  of 
interacting  via  both  the  weak  and  strong  forces.  Baryons  have  a baryon  number  with  anti-baryons 
(the  anti-particles  of  baryons)  having  the  negative  baryon  number  of  the  corresponding  normal 
baryons.  Baryon  number  is  conserved  in  nuclear  reactions. 

A hadron  is  any  fundamental  particle  that  is  capable  of  interacting  via  the  strong  force.  All 
baryons  are  hadrons.  Mesons  are  integer-spin  hadrons.  Mesons  are  not  conserved  in  nuclear 
reactions. 

Electrons  are  light  (0.511  MeV)  stable  leptons  with  a -1  electrical  charge.  Electrons 
combine  with  nuclei  to  form  atoms.  They  may  be  ejected  from  atoms  by  nuclear  scattering 
processes.  They  are  produced  in  a common  form  of  radioactive  decay  called  beta-minus  decay 
leading  to  the  common  name  of  beta  (or  more  properly  beta-minus)  particles.  They  are  also 
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produced  together  with  positrons  are  produced  in  pair  production  absorption  of  gamma  rays. 

Positrons  are  the  anti-particles  of  electrons  having  -i-l  electrical  charge,  but  also  having  the 
same  mass  and  spin  as  electrons.  Positrons  are  emitted  in  beta-plus  radioactive  decay.  They  are  also 
produced  along  with  electrons  (one  electron  and  one  positron  for  each  absorbed  gamma  ray)  in  pair 
production  absorption  of  gamma  rays.  When  a positron  collides  with  an  electron,  the  two  particles 
may  annihilate  each  other  producing  two  gamma  rays.  Their  complete  masses  of  the  betas  are 
converted  into  energy  in  the  form  of  two  0.51 1 MeV  gamma  rays. 

Gamma  rays  are  photons  (or  quanta)  of  electromagnetic  radiation.  Gamma  rays  are  often 
produced  when  excited  nuclei  make  transitions  to  lower  energy  states.  When  electrons  in  excited 
atoms  make  similar  transitions  we  call  the  resulting  photons  x-rays.  Gamma  rays  typically  have 
energies  of  0.1-10  MeV  while  x-rays  typically  have  energies  of  1-100  keV. 

Protons  are  stable  baryons  of  moderate  mass  (938  MeV)  having  -i-l  electrical  charge  and 
spin  Vi.  Protons  are  one  of  the  primary  constituents  of  nuclei. 

Neutrons  are  electrically  neutral  baryons  of  moderate  mass  (940  MeV).  Neutrons  are  the 
second  major  constituent  of  nuclei.  Because  of  this,  neutrons  and  protons  are  collectively  referred 
to  as  nucleons.  Free  neutrons  decay  to  protons  and  electrons  with  a half-life  of  930  seconds. 
Neutrons  bound  in  nuclei  are  stable.  Free  neutrons  are  produced  in  many  particle  interactions. 

Neutrinos  are  massless,  electrically  neutral  leptons  produced  in  weak  interactions  such  as 
beta  decay.  They  are  produced  to  balance  the  lepton  number  in  the  beta  decay  processes.  If  an 
electron  (beta-minus)  is  produced  in  a decay  process,  an  anti-electron  neutrino  must  be  produced 
to  balance  the  lepton  number.  If  a muon  decays  into  an  electron,  a muon  neutrino  and  an  anti- 
electron neutrino  must  be  produced  to  balance  the  lepton  number.  Practically  speaking,  the 
interaction  probability  of  neutrinos  with  normal  matter  is  so  low  that  they  effectively  do  not  interact. 
An  average  neutrino  can  pass  straight  the  center  of  the  earth  without  interacting. 

Alpha  particles  are  bare  nuclei  of  Helium-4.  They  consist  of  two  protons  & two  neutrons 
bound  together  by  the  strong  force.  The  heavy  (3.727  GeV),  charge  +2,  spin  0 particles  are 
extraordinarily  stable.  They  are  emitted  in  the  alpha  decay  of  many  radioactive  nuclei  and  in  a 
variety  of  particle-nucleus  interactions. 

Muons  are  heavy  (106  MeV)  leptons  produced  in  the  interaction  of  cosmic  rays  with  nuclei 
in  the  upper  atmosphere.  They  are  also  produced  in  a variety  of  high-energy  particle  interactions. 
Muons  decay  to  electrons  (and  two  neutrinos)  with  a half-life  of  2.2  psec. 

Deuterons  are  bare  nuclei  of  deuterium  (Hydrogen-2).  They  contain  1 proton  & 1 neutron. 
Tritons  are  bare  nuclei  of  tritium  (Hydrogen-3)  and  contain  1 proton  & 2 neutrons.  Helium-3 
nuclei  contain  2 protons  & 1 neutrons.  All  three  are  occasionally  emitted  in  radio-active  decay. 
They  are  also  produced  in  many  particle-nucleus  interactions.  The  parent  gases  (deuterium,  tritium, 
and  Helium-3)  are  available  in  sufficient  quantities  that  the  ions  can  be  accelerated  in  particle 
accelerators  to  produce  different  artificial  isotopes  for  industrial  and  medical  purposes. 
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The  particles  described  above  and  their  properties  are  summarized  in  Table  7-1  below. 


Table  7-1.  Commonly  encountered  nuclear  particles. 


PARTICLE 

INTERACTIONS? 

ELEC 

AAASS 

LEPTON 

BARYON 

LIFETIME 

NAME 

EM 

WK 

SI 

CHG 

(MeV) 

SPIN 

NUMBER 

NUMBER 

(sec) 

GAMAAA  (y) 

Y 

N 

N 

0 

0 

1 

0 

0 

Stable 

ELECTRON  (e  ,p  ) 

Y 

Y 

N 

-1 

0.51099906 

V2 

le 

0 

Stable 

POSITRON  (e^p^) 

Y 

Y 

N 

+ 1 

0.51099906 

V2 

-le 

0 

"Stable" 

PROTON  (p) 

Y 

Y 

Y 

+ 1 

938.27231 

V2 

0 

1 

Stable 

ANTI  PROTON  (p  ) 

Y 

Y 

Y 

-1 

938.27231 

V2 

0 

-1 

"Stable" 

NEUTRON  (n) 

N 

Y 

Y 

0 

939.56563 

V2 

0 

1 

930 

e NEUTRINO 

N 

Y 

N 

0 

0 

V2 

± 1 e 

0 

Stable 

p NEUTRINO 

N 

Y 

N 

0 

0 

V2 

±lp 

0 

Stable 

MUON  (p) 

Y 

Y 

N 

±1 

105.658389 

V2 

±lp 

0 

2.198X  10 

ALPHA  (a) 

Y 

Y 

Y 

+2 

3727.3799 

0 

0 

4 

Stable 

DEUTERON  (d) 

Y 

Y 

Y 

+ 1 

1875.61339 

1 

0 

2 

Stable 

TRITON  (t) 

Y 

Y 

Y 

+ 1 

2808.5561 

V2 

0 

3 

3.875  X 10' 

HELIUM-3  fHe) 

Y 

Y 

Y 

+ 1 

2808.5375 

V2 

0 

3 

Stable 

PION  (tc) 

N 

N 

Y 

0 

134.975 

1 

0 

0 

00 

X 

o 

Y 

N 

Y 

±1 

139.579 

1 

0 

0 

2.604  X 10- 

A nucleus  (the  plural  is  nuclei)  is  a collection  of  protons  and  neutrons  bound  together  by  the 
strong  (nuclear)  force  in  a more  or  less  stable  arrangement.  Unstable  nuclei  transform  themselves 
into  more  stable  nuclei  via  processes  collectively  known  as  radioactive  decay.  Positively-charged 
nuclei  are  massive  (many  GeV)  and  may  possess  either  integer  or  half-integer  spin  depending  on  the 
numbers  of  protons  and  neutrons  they  contain. 

Nuclei  at  rest  tend  to  attract  electrons  because  of  their  positive  charge.  Nuclei  which  have 
attracted  a number  of  electrons  equal  to  the  number  of  protons,  and  thus  have  become  electrically 
neutral,  are  called  atoms.  The  chemical  properties  of  an  atom  is  determined  by  the  number  of 
electrons  it  has  (and  thus  the  number  of  protons  in  its  nucleus).  Over  the  years  the  chemical 
community  was  able  to  identify  a moderate  number  of  distinct  chemical  elements,  atoms  which 
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react  in  unique  and  predictable  ways.  With  the  development  of  the  “nuclear”  theory  of  atoms  we 
are  able  to  organize  the  chemical  elements  into  a periodic  table  that  is  based  on  atomic  number 
(Z  equal  to  the  number  of  protons  in  each  nucleus).  Figure  7-1  shows  the  author’s  version  of  the 
periodic  table.  This  version  has  been  updated  to  include  the  latest  data  as  of  roughly  1 January  2000. 
Elements  beyond  Z=109  have  not  yet  been  officially  named.  Many  individuals  use  an  interim 
nomenclature  based  on  the  value  of  Z.  Thus  Z=1 10  is  ununnilium  (un=l,  un=l,  nil=0,  -i-ium)  with 
corresponding  symbol  Uun.  Z=lll  is  unununium  (Uuu),  Z=112  is  ununbium  (Uub),  Z=113  is 
ununtrium  (Uut),  etc.  The  naming  of  newly  discovered  elements  evokes  strong  passions  in  the 
participants.  The  interim  nomenclature  is  sufficiently  unbiased,  no  individual  name  will  ever  be  a 
candidate  for  permanent  adoption,  and  can  automatically  accommodate  all  new  discoveries,  so  its 
continued  use  for  new  elements  is  virtually  assured. 

Each  element  has  a different  unique  atomic  number.  The  number  of  neutrons  (N)  in  a 
nucleus  of  any  given  element  is  not  fixed.  Different  neutron  numbers  lead  to  different  total  masses 
of  a nucleus.  The  mass  number  (A)  of  a nucleus  is  the  sum  of  its  atomic  number  (number  of 
protons)  and  its  neutron  number.  Components  of  the  same  element  with  different  masses  (different 
neutron  numbers  and  mass  numbers)  are  called  isotopes.  Radioisotopes  are  isotopes  that  are 
unstable  and  subject  to  radioactive  decay.  The  atomic  weight  of  an  isotope  is  roughly  equal  to  its 
mass  number.  The  atomic  weight  of  an  element  is  an  average  of  the  atomic  weights  of  each  isotope 
weighted  by  the  fractional  natural  abundance  of  those  isotopes.  The  term  nuclide  is  used  to 
distinguish  unique  species  with  different  atomic  numbers  and  neutron  numbers.  Any  possible 
nucleus  can  be  represented  as  a distinct  nuclide.  Radionuclides  are  nuclides  that  are  unstable  and 
subject  to  radioactive  decay. 

The  difference  between  the  mass  number  and  the  atomic  weight  is  due  to  three  factors.  One 
is  the  fact  that  atomic  weights  for  atoms  contain  the  weights  associated  with  the  bound  electrons 
(roughly  0.05%  of  the  total  mass).  Second,  when  nucleons  are  brought  into  close  contact  to  form 
a nucleus,  their  net  energy  is  less  than  the  energy  of  the  isolated  particles.  This  energy  difference 
is  called  the  binding  energy.  As  we  shall  see  in  a later  section,  the  binding  energy  is  a complicated 
function  of  the  number  of  neutrons  and  protons.  Since  mass  and  energy  are  equivalent,  the  binding 
energy  reduces  the  mass  of  the  nucleus  below  the  mass  that  would  result  from  summing  the  masses 
of  the  separated  neutrons  and  protons.  Third,  atomic  weights  are  normalized  to  the  atomic  weight 
of  Carbon-12  (A=12,  Z=6)  with  that  weight  being  defined  as  12.000  •••.  Thus,  a proton  has  an 

atomic  weight  of  1.00727648,  a neutron  has  an  atomic  weight  of  1.00866491,  an  electron  has  a 
atomic  weight  of  0.00054858,  and  a hydrogen  atom  (proton  -i-  electron)  has  an  atomic  weight  of 
1.00782504. 

Nuclear  binding  energies  are  useful  in  a variety  of  ways.  They  can  be  used  in  calculations 
of  energy  production/absorption  in  nuclear  reactions.  They  can  be  used  to  assist  in  predicting  the 
kinds  of  reactions  that  a nucleus  may  undergo.  A common  form  of  expression  is  the  binding  energy 
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Fiaure  7-1.  Periodic  Table  of  Uie  Elements 
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per  nucleon  which  is  the  approximate  energy  that  must  be  supplied  to  remove  a single  nucleon  from 
a nucleus.  In  practice  there  are  differences  between  removing  a proton  or  a neutron  and  there  are 
differences  related  to  the  stability  of  the  resulting  nucleus.  However,  binding  energy  per  nucleon 
is  useful  for  thinking  purposes. 

The  mass  defect  is  the  difference  in  mass  between  the  isolated  particles  that  might  make  up 
a specific  nucleus  and  the  mass  of  that  nucleus  [1]: 


AM  = Z(m^  + m J 

= Zm^  + Nm^  - 


(7.1) 


where  the  masses  are  usually  expressed  in  atomic  mass  units  (amu)  based  on  the  mass  of  Carbon- 12. 
The  total  energy  difference  is  the  product  of  AM  and  the  energy  of  1 amu  (=  931.4943  MeV) 


A£  = 931.4943(Zm„  + Nm„  - 

= 931.4943(1.00782504Z  + 1.00866491)V  - 


(7.2) 


This  energy  is  equal  to  the  total  binding  energy,  thus  the  binding  energy  per  nucleon  is 

^E  r 93 1.4943^ 


V 


J 


(1.00782504Z  + 1.00866491V  - . (7.3) 


For  example,  consider  Helium-4,  with  =4.00260323 , Z=2,  N=2,  and  A=4,  then 


^ r 93 1.4943 
A 


V 


1(1.00782504(2)  + 1.00866491(2)  - 4.00260323) 


4 ; 

= 7.07392  MeV  / nucleon 


Several  other  noteworthy  examples  are  deuterium  (Hydrogen-2)  with  M„^,^i^^^=2.01410178,  Z=l, 
N=l,  and  A=2, 


AE 


1^93 1.4943 


V 


(1.00782504(1)  + 1.00866491(1)  - 2.01410178) 


= 1.11228  MeV /nucleon 


Uranium-235  with  M„^^, ^^=235.043924,  Z=92,  N=143,  and  A =255, 
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^E 


( 931.4943 


V 


235  J 


1(1.00782504(92)  + 1.00866491(143)  - 235.043924) 


= 7.59093  MeV  / nucleon 


and  Iron-56  with  M , =55.934940,  Z=26,  N=30,  and  A=56, 


AE 


f 931.4943 


^ 56  J 


1(1.00782504(26)  + 1.00866491(30)  - 55.934940) 


= 8.79028  MeV  / nucleon 


A complete  presentation  of  the  results  of  Eq.  (7.3)  is  shown  in  Figure  7-2  which  shows  the 
binding  energy  per  nucleon  for  the  longest  lived  nuclides  at  each  mass  number.  The  curve  is 
relatively  smooth  with  a few  prominent  bumps.  The  most  noticeable  bump  is  at  Helium-4.  The  very 
high  binding  energy  means  that  alpha  particles  are  among  the  most  stable  of  all  light  particles.  It 
is  for  this  reason  that  they  show  up  as  radioactive  decay  products.  It  is  more  energetically  favorable 
for  many  nuclei  to  emit  an  alpha  particle  than  it  is  for  them  to  emit  a proton,  neutron,  deuteron, 
triton,  or  other  similar  particle.  The  peak  of  the  binding  energy  curve  occurs  near  the  value  of  A=56. 
Iron-56  is  among  the  most  tightly  bound  of  all  nuclei. 


Figure  7-2.  The  Curve  of  Binding  Energy.[l] 


245 


The  curve  of  binding  energy  is  most  enlightening  when  studied  in  detail,  predicting  the 
energetic  favorability  of  nuclear  fission  and  the  iron/nickel/cobalt  endpoint  of  stellar  fusion 
processes.  We  will  return  to  this  curve  in  later  discussions. 
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Nuclear  Particle  Reactions 


A nuclear  particle  reaction  is  any  process  in  which  an  incident  nuclear  particle  (electron, 
gamma  ray,  neutron,  etc.)  interacts  with  either  a target  nuclear  particle  or,  more  likely,  with  a target 
atom  or  nucleus  to  cause  a change  in  one  or  all  of  the  participants.  Reactions  may  produce  entirely 
different  particles,  they  may  cause  particles  to  disappear,  or  they  may  cause  an  internal  change  in 
energy  of  one  or  more  of  the  participants.  A number  of  the  more  commonly  experienced  nuclear 
reactions  are  identified  and  defined  below. 

The  strength  of  many  of  these  reactions  are  described  by  reaction  cross  sections.  The  cross 
section  (cr)  for  a nuclear  reaction  is  the  effective  area  (cm^)  that  an  atom,  nucleus,  or  particle 
presents  to  an  incident  particle  with  respect  to  “colliding  and  undergoing  a specific  nuclear 
reaction”.  Cross  sections  for  nuclear  reactions  are  usually  very  small.  A barn  is  a unit  of  cross 
section  equal  to  1 x 10‘^"^  cm^,  that  has  been  found  to  be  particular  relevant  to  nuclear  reactions.  A 
differential  cross  section  is  a distribution  of  cross  section  versus  some  parameter,  e.g.,  scattering 
angle.  The  units  of  an  angular  differential  cross  section  are  cmVsr  vs.  6. 

Elastic  scattering  is  a scattering  process  in  which  the  incident  particle  preserves  its  identity 
throughout  the  scattering  event,  and  in  which  significant  momentum  but  limited  energy  is  transferred 
from  the  incident  particle  to  the  target  nucleus.  The  target  nucleus  is  not  excited  to  any  higher 
internal  energy  in  an  elastic  scattering  process.  The  actual  energy  lost  is  a function  of  the  angle  at 
which  the  particle  is  scattered.  The  ratio  of  scattered  particle  energy  to  incident  particle  energy  is 
given  by  the  relation  [1] 

^ A^  + 2AMcos0+M^- 

^incident  (A+M) 

where  M is  the  mass  of  the  incident  particle,  A is  the  mass  of  the  target  nucleus,  and  6 is  the  angle 
of  scattering  relative  to  the  incident  direction.  The  maximum  energy  loss  occurs  for  6 = 180°  and 
is  given  by 


A^+2AM(-1)  + M^  {A-Mf 

Emcidem  mIN  (A+M)  (A+M) 

Although  we  have  stated  that  the  energy  loss  is  limited,  this  is  really  only  true  for  heavy  target 
nuclei.  If  A=100M,  then  the  minimum  ratio  of  scattered  energy  to  incident  energy  is  0.96. 
However,  if  we  are  interested  in  elastic  neutron  scattering  from  hydrogen  atoms,  then  A~M  and  the 
energy  lost  can  range  from  zero  for  small-angle  scattering  to  virtually  all  of  the  initial  energy  for 
backwards  scattering.  The  large  elastic  scattering  energy  losses  in  hydrogen  make  it  of  interest  in 
neutron  moderation  and  neutron  shielding  applications. 
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Elastic  scattering  of  gamma  rays  is  called  Rayleigh  scattering  or  coherent  scattering.  The 
energy  of  the  scattered  gamma  is  essentially  unchanged.  Coherent  scattering  is  only  important  at 
gamma  energies  below  roughly  10  keV.  Even  in  this  low  energy  regime  coherent  scattering  is 
usually  very  small  compared  to  photoelectric  absorption.  Electron  binding  energies  are  usually  so 
small  that  inelastic  scattering  processes  are  energetically  favorable  and  overshadow  coherent 
scattering. 

Inelastic  scattering  is  a scattering  process  in  which  the  incident  particle  preserves  its 
identity  throughout  the  scattering  event,  and  in  which  significant  energy  and  momentum  is 
transferred  from  the  incident  particle  to  the  target  nucleus.  The  target  nucleus  is  left  in  an  excited 
energy  state,  although  the  excitation  energy  may  be  rapidly  radiated  away  as  gamma  rays. 

Compton  scattering  is  the  inelastic  scattering  of  gamma  rays  from  electrons  bound  in  atoms. 
The  gamma  ray  scatters  with  significant  energy  loss;  the  lost  energy  is  transferred  to  an  electron 
which  gains  sufficient  energy  to  be  ejected  from  the  target  atom.  The  energy  of  the  scattered  gamma 
hv’  is  related  to  the  energy  of  the  incident  gamma  hvhy  [2] 


hv' 


hv 

\ + {hv  I mc^y^X  - cos  6) 


(7.6) 


where  m is  the  mass  of  an  electron,  ^is  the  angle  through  which  the  gamma  ray  is  scattered,  and  v 
is  the  frequency  of  the  incident  photon.  The  recoil  electron  has  energy  given  by 


‘RECOIL 


= h{v-v')-B.E. 


(7.7) 


where  B.E.  is  the  binding  energy  of  the  electron  (usually  negligible).  The  differential  cross  section 
for  scattering  through  an  angle  6 is  given  by  the  Klein-Nishina  formula 


d(7  /.  2 

in  cm 


/ sr)  = 


OO  22  24 

iZTT  SqM  C 


y>  \ 

— I Sin  6 


V 

yyj 


yv 


(7.8) 


j 


Integrating  this  function  over  all  angles  0 gives  the  total  Compton  scattering  cross  section.  The 
recoil  electron  deposits  its  energy  in  the  general  vicinity  of  its  production.  The  scattered  gamma  ray 
usually  leaves  the  vicinity  before  reacting  further.  The  fraction  of  the  total  energy  that  is  deposited 
in  the  vicinity  of  a Compton  event  is  sometimes  called  Compton  absorption.  The  Compton 
absorption  coefficient  is  the  product  of  the  total  Compton  scattering  attenuation  coefficient  and  the 
fraction  of  the  energy  absorbed.  The  Compton  scattering  coefficient  is  the  fraction  of  the  energy 
carried  away  by  the  scattered  gamma  times  the  total  Compton  scattering  attenuation  coefficient. 
This  will  become  clearer  after  reading  the  next  section  on  cross  sections  and  attenuation  coefficients. 

The  photoelectric  effect  is  a process  in  which  x-rays  or  gamma  rays  are  absorbed  by 
electrons  bound  in  atoms.  The  absorption  process  transfers  sufficient  energy  to  an  electron  for  it  to 
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be  ejected  from  the  target  atom.  The  energy  of  the  ejected  electron  is  equal  to  the  incident  gamma 
energy  /?v  minus  the  binding  energy  B.E.  of  the  electron  in  its  atomic  state. 


E 


PHOTOELECTRON 


- hv  - B.E. 


(7.9) 


In  metals  and  covalent  solids,  the  electrons  are  not  bound  to  individual  atoms  but  are  distributed 
throughout  the  solid.  In  this  case  the  binding  energy  should  be  replaced  by  the  work  function  of  the 
solid.  At  low  gamma  ray  photon  energies  the  atomic  binding  energies  of  the  electrons  play  a 
significant  role  in  the  shape  of  the  photoelectric  cross  section  curves.  In  general  the  probability  of 
photoelectric  scattering  is  proportional  to  [3] 


3 h v{h  V - 


(7.10) 


where  Z is  the  nuclear  charge,  is  the  classical  electron  radius,  a is  the  fine  structure  constant,  and 
^•is  the  binding  energy  of  the  electron  to  be  ejected. 

Pair  production  is  a process  in  which  a gamma  ray  collides  with  any  particle  and  disappears 
with  the  simultaneous  creation  of  an  electron  and  a positron.  Only  gamma  rays  with  energies  in 
excess  of  1 .022  MeV  are  capable  of  pair  production.  The  electron  and  the  positron  each  possess  an 
initial  kinetic  energy  that  is  equal  to  one-half  the  difference  between  the  photon  energy  h vand  1 .022 
MeV  (the  rest  mass-energy  of  an  electron  and  a positron) 

(/iv- 1.022  MeV).  (7.11) 

The  cross  section  for  pair  production  is  not  easily  calculated.  However,  at  energies  greater  than  the 
1 .022  MeV  threshold  but  below  extreme  relativistic  energies,  the  cross  section  will  take  the  form 
[4] 


a = aZ^r^ 


^ hv-2mc^  ^ 
hv 


F(v) 


(7.12) 


where  F{v)  is  a complicated  function  that  varies  from  about  2 at  low  energies  to  around  10  at 
energies  around  100  MeV. 

All  charged  particles  can  interact  with  materials  through  the  processes  of  ionization  and 
atomic  excitation.  The  electric  field  of  the  charged  particle  interacts  with  an  electron  in  the 
material  imparting  a small  energy  to  it.  This  energy  may  be  sufficient  to  cause  the  electron  to  be 
ejected  (ionization)  or  to  be  elevated  to  a higher  internal  energy  level  (atomic  excitation).  Both 
processes  are  most  important  when  the  charged  particle  energy  is  low,  but  obviously  cannot  occur 
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if  the  charged  particle  energy  is  less  than  the  excitation  or  ionization  energy.  The  cross  section  for 
atomic  excitation  is  given  approximately  by  [5] 


\nza^ldc 


^ n^m 


1 |2  , 

r In 

1 nm  \ 

kAEj 

(7.13) 


where  E is  the  kinetic  energy  of  the  charged  particle,  z is  the  charge  of  the  incident  particle,  fj,  is  the 
reduced  mass  of  the  incident  particle,  \r^\^  is  the  electric  dipole  matrix  element  connecting  the  initial 
atomic  state  n to  final  atomic  state  m,  and  A£"  is  the  n-^m  transition  energy  or  the  ionization  energy, 
as  appropriate.  The  reduced  mass  of  a particle  of  mass  interacting  with  a particle  of  mass  M2  is 
given  by 


//  = — 

Ml  + M2 


(7.14) 


The  cross  section  for  ionization  is  essentially  the  same  as  Eq.  (7.13)  except  that  the  bound  state- 
bound  state  transition  matrix  element  is  replaced  by  a bound  state-free  electron  matrix  element.  The 
energy  dependence  will  be  essentially  the  same. 

The  qualitative  behavior  of  excitation/ionization  is  shown  in  Figure  7-3.  There  is  a threshold 
at  £■  = AE,  with  a steep  rise  to  a maximum,  followed  by  a slow  fall  that  is  roughly  proportional  to 
HE.  The  appearance  of  a threshold  is  required  by  energy  conservation  (an  atom  cannot  be  excited 
to  a state  AE  higher  than  its  original  state  unless  the  incident  photon  has  an  energy  of  at  least  A£). 
Eq.  (7.13)  is  not  quantitatively  valid  in  the  region  below  and  just  above  the  maximum.  The  cross 
section  expression  was  developed  under  an  assumption  of  moderate  electron  energies.  Note  that  the 
term  /nf^E/AE)  implies  a threshold  at  E = AE/4,  which  is  not  physically  allowable.  Eq.  (7.13) 
should  give  reasonable  results  for  electron  energies  above  a few  hundred  electron  volts. 

The  magnitude  of  the  total  excitation  and  or  ionization  would  be  determined  by  integrating 
over  all  possible  transitions.  The  total  cross  sections  can  be  estimated  by  replacing  the  matrix 
elements  in  Eq.(7. 13)  with  a quantity  rja^  where  Uq  is  the  Bohr  radius  and  7 is  a number  of  the  order 
of  unity  which  is  theoretically  derivable  from  atomic  transition  sum  rules  for  the  target  atom. [6], [7] 
The  values  of  rj  for  excitation  and  ionization  will  not  necessarily  be  the  same.  We  can  estimate  what 
values  of  the  peak  cross  section  may  be  expected  for  electrons.  Since  fic^  = 511  keV,  if  we  assume 
E = 100  eV,  AE  = 10  eV  and  rj  ~1,  then  the  result  is  <t~  3.5  x 10'^®  cm^.  For  heavier  charged 
particles  the  cross  sections  will  scale  as  and  approximately  as  m.  We  would  expect  alpha  particle 
cross  sections  to  be  roughly  3 x 10"^  larger  than  the  corresponding  electron  cross  sections. 

Induced  fission  is  a process  in  which  an  incident  particle  (usually  a neutron,  but  possibly 
another  kind  of  particle)  is  absorbed  by  a nucleus,  so  disrupting  the  nucleus  that  it  breaks  into  two 
smaller  nuclei  with  the  simultaneous  emission  of  a number  of  other  particles  (usually  neutrons). 
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Figure  7-3.  Energy  dependence  of  the  atomic  excitation  or  ionization  cross  section. 


Transmutation  is  a collection  of  reaction  processes  in  which  an  incident  particle  is  absorbed 
by  a target  nucleus,  with  or  without  another  nuclear  particle  being  ejected,  and  producing  a different 
nucleus  as  a product.  The  product  nucleus  may  or  may  not  be  in  an  excited  internal  energy  state. 
Transmutation  reactions  include:  photonuclear  processes  in  which  a gamma  ray  is  absorbed  by  a 
nucleus  and  one  or  more  neutrons  or  protons  or  both  are  ejected,  capture  reactions  in  which  the 
incident  particle  is  captured  with  no  particles  other  than  gamma  rays  being  emitted,  and  particle- 
particle  exchange  reactions  in  which  one  particle  is  absorbed  by  the  target  nucleus  and  one  or  more 
other  particles  are  ejected. 

Nuclei  and  many  nuclear  particles  may  be  described  by  the  shorthand  notation 

Arji  N 

where  Z is  the  atomic  number  (number  of  protons),  N is  the  number  of  neutrons,  A = N+Z  is  the 
mass  number,  and  T is  the  chemical  symbol  for  the  element  with  atomic  number  Z.  For  complete- 
ness in  describing  most  reactions  of  interest  we  may  assume  the  following  definitions: 


qII*  = neutron 
Jp°  = proton  =JH° 
= gamma  ray 
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A typical  nuclear  transmutation  reaction  can  be  described  by  the  equation 


A\r^^N\ 


Z1 


T1 


Target 


Nucleus  j 


+ ^373"' 


^ Incident^ 


Particle  j 


Z4^  ^ 


^ Exiting  ^ 


Particle  j 


■ A2rjir\N2 

Z2^  ^ 


^ Product  ^ 


y Nucleus  J 


(7.15) 


with  associated  conservation  laws  for  N,  A,  and  Z being  expressed  as 

NI+  N3^  N4  + N2 

A1+  A3=  A4  + A2 
Z1  + Z3  = Z4  + Z2 


(7.16a) 

(7.16b) 

(7.16c) 


Often  reaction  (7.15)  will  be  written  using  the  shorthand 

2lrP‘(X,F)f72"'"  (7.17) 

where  X is  the  incident  particle  and  Y is  the  exiting  particle. 


Figure  7-4  lists  the  product  nuclei  relative  to  a standard  target  nucleus  for  most  of  the 
common  transmutation  reactions.  [8]  The  shorthand  described  above  is  used  extensively  in  this 
figure.  Figure  7-4  also  describes  the  daughter  nuclei  for  a number  of  radioactive  decay  modes.  As 
an  illustration  of  the  use  of  this  figure,  an  a capture  reaction  (uppermost  righthand  corner  of  chart) 
produces  the  product  nucleus  with 


A+4  'X'  O N +2 

Z+2  ^ ^ 


a capture 


while  either  a (n,2n),  a (y,n),  or  a (p,pn)  reaction  (left-hand  middle  of  chart)  will  result  in  a product 
nucleus  with 


2 r " ^ (n,2n),  (y,n),  or  (p,pn). 

Note  that  (p,pn)  denotes  that  one  proton  is  incident  on  the  target  while  both  a proton  and  a neutron 
are  emitted.  Alpha  decay  (lower  left-hand  comer  of  chart)  will  result  in  a product  nucleus  with 

A-4  rj.  ^ N-2  , 

2_2  1—2  a decay. 
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Figure  7-4.  Common  transmutation  reaction  products  and  radioactive  decay  products. [8] 
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Not  all  of  the  reactions  described  above  occur  for  all  particles  and  not  all  occur  at  all  energies 
of  those  particles.  In  Table  7-2  we  have  summarized  the  dominant  reactions  of  the  more  common 
nuclear  particles  and  the  energy  ranges  at  which  those  reactions  are  usually  important.  The  reactions 
may  or  may  not  be  possible  outside  of  those  ranges.  As  discussed  above,  many  reactions  have 
thresholds  below  which  they  cannot  occur. 
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Table  7-2.  Dominant  reactions  of  nuclear  particles 


PARTICLE 

DOMINANT  REACTIONS 

ENERGY 

PHOTON 

ATOMIC  EXCITATION 
RAYLEIGH  SCATTERING 
PHOTOELECTRIC  ABSORPTION 
COMPTON  SCATTERING 
PAIR  PRODUCTION 
TRANSMUTATION  - e.g.  (y,n),  (y,p) 

MeV  AND  BELOW 
eVAND  BELOW 
TOO  keVAND  BELOW 
keV  AND  ABOVE 
1 MeV  AND  ABOVE 
MeV  AND  ABOVE 

ELECTRON 

ATOMIC  EXCITATION 
ELECTRO-IONIZATION 
PAIR  PRODUCTION 
TRANSMUTATION  - e.g.  (e,n),  (e,p) 

lOeV  AND  ABOVE 
keV  AND  ABOVE 
MeV  AND  ABOVE 
MeV  AND  ABOVE 

PROTON 

ATOMIC  EXCITATION 
IONIZATION 
ELASTIC  SCATTERING 
INELASTIC  SCATTERING 
TRANSMUTATION  - e.g.  (p,n),  (p,a) 

10  eV  AND  ABOVE 
keV  AND  ABOVE 
keV  AND  ABOVE 
MeV  AND  ABOVE 
MeV  AND  ABOVE 

NEUTRON 

ELASTIC  SCATTERING 
INELASTIC  SCATTERING 
ABSORPTION  (TRANSMUTATION) 
FISSION  (HEAVY  ELEMENTS  ONLY) 

ALL  ENERGIES 
MeV  AND  ABOVE 
ALL  ENERGIES 
MeV  AND  ABOVE 
(except  fissile  nuclei) 

ALPHA 

ATOMIC  EXCITATION 
IONIZATION 
ELASTIC  SCATTERING 
INELASTIC  SCATTERING 
TRANSMUTATION 

10  eV  AND  ABOVE 
keV  AND  ABOVE 
keV  AND  ABOVE 
MeV  AND  ABOVE 
MeV  AND  ABOVE 
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Nuclear  Particle  Reaction  Rates 


Before  attempting  to  describe  nuclear  reaction  rates  a few  definitions  are  in  order.  The 
number  of  particles  passing  through  a unit  area  per  unit  time  is  called  the  flux  density. 

Flux  density  = (f>=  particles/sec/cm^  (7.18) 

Fluence  is  the  time  integral  of  flux  density.  Therefore,  fluence  is  the  total  number  of  particles  which 
have  passed  through  a unit  area. 

Fluence  = particles/cm^  (7.19) 

The  reaction  rate  relative  to  a single  target  atom  is  the  product  of  the  flux  density  and  the 
cross  section 


= <y‘t’  ■ (7.20) 

The  reaction  rate  per  unit  volume  of  target  material  is  the  product  of  the  target  atomic  density  (AO 
in  atoms/cm^  times  the  reaction  rate  per  atom 

K,u..  = Ncyil>.  (7.21) 

The  reaction  rate  per  unit  area  occurring  in  a small  distance  element  dx  is  the  volume  reaction  rate 
times  the  distance  element 


= Natf)  dx  = -d^ . 


{121) 


This  same  rate  is  the  rate  at  which  particles  are  lost  from  an  incident  beam.  That  is,  it  is  equal  to  the 
change  in  the  flux  density.  Rearrangement  of  this  equation  gives  the  following  differential  equation 
for  the  flux  density 


d(j) 

dx 


-No-^ 


(7.23) 


This  kind  of  differential  equation  can  be  easily  solved  by  rearranging  to  form  the  logarithmic 
derivative  and  integrating.  That  is 


and 


d(j) 

T 


-d\n(/)-  -N(7  dx 


N(j  dx . 


(7.24) 


(7.25) 
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This  has  solutions 


and 


In  ^{x)  - In  ^(O)  - -Nax 

<P{x)  = . 


(7.26) 

(7.27) 


Thus  the  flux  density  is  seen  to  decrease  exponentially  with  propagation  distance  into  the  target. 
This  type  of  behavior  is  characteristic  of  many  kinds  of  attenuation  processes. 

The  characteristic  distance  over  which  the  flux  density  decreases  is  the  1/e  length.  That  is, 
we  set  the  exponential  equal  to  1/e  and  solve  for  the  distance: 

^-iVax  ^ 1/^  ^ ^-1  ^^  28) 


with  logarithm 


- Nax  = - 1 (7.29) 

and  subsequent  solution 

x = llNa.  (7.30) 

The  1/e  characteristic  length  is  commonly  called  the  attenuation  (or  absorption)  length.  From  Eq. 
(7.30)  we  see  that  the  attenuation  length  is  inversely  proportional  to  the  density  of  the  target  and 
inversely  proportional  to  the  reaction  cross  section. 
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Attenuation  of  Nuclear  Radiation 

The  macroscopic  cross  section  Z is  the  product  of  the  target  atomic  density  (atoms/cm^)  and 
the  cross  section  per  atom.  It  is  equivalent  to  the  attenuation  length. 

(7.31) 

The  macroscopic  cross  section  is  also  called  the  linear  attenuation  coefficient  p.  Both  quantities 
have  units  of  cm  '.  The  reciprocal  of  the  linear  attenuation  coefficient  is  the  average  distance  a 
particle  travels  in  a material  before  reacting.  The  mass  attenuation  coefficient  (p/p)  is  the  linear 
attenuation  coefficient  (p  in  l/cm)divided  by  the  mass  density  (p  in  g/cm^)  of  the  material.  It 
represents  the  total  reaction  cross  section  per  unit  mass  of  target  material.  It  has  units  of  cmVg. 


— =Nalp  (7.32) 

P 

Equations  (7.8)  and  (7.12)  have  alternate  forms  which  use  the  mass  attenuation  coefficient. 
The  alternate  forms  are 


and 


d(!) 

dx 


kP) 


p(!> 


(j){pc)  - (j){jS)e  . 


(7.33) 


(7.34) 


Just  as  we  saw  in  the  case  of  electromagnetic  radiation,  attenuation  (or  extinction)  of  nuclear 
radiation  has  two  parts:  absorption  and  scattering.  Scattering  takes  radiation  moving  in  one  (usually 
fixed)  direction  and  converts  it  into  radiation  moving  in  another  (usually  random)  direction. 
Scattering  removes  radiation  from  a beam  of  radiation  but  does  not  eliminate  it  from  the  total 
environment.  Absorption  converts  radiation  into  absorbed  energy.  Radiation  is  removed  not  only 
from  a beam  but  from  the  total  environment.  The  removed  energy  is  absorbed  within  the  “absorb- 
ing” material  and  converted  to  internal  atomic  and  nuclear  excitations  and  ultimately  heat.  The  total 
attenuation  coefficient  can  be  written  as  the  sum  of  the  scattering  and  absorption  components: 
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(7.35) 


(7.36) 


Some  interaction  processes  will  contribute  to  absorption,  some  contribute  to  scattering,  and 
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some  contribute  to  both  (an  inelastic  scattering  process  scatters  some  energy  out  of  the  beam,  while 
some  of  the  energy  is  deposited  in  the  target  medium).  For  example,  Figure  7-5  shows  the  processes 
important  to  gamma  ray  scattering  in  aluminum  in  the  energy  range  10  keV  to  100  MeV.  The  total 
(mass)  attenuation  coefficient  is  the  sum  of  contributions  from  photoelectric  absorption,  Compton 
scattering  and  Compton  absorption,  and  pair  production.  The  total  (mass)  absorption  coefficient 
lacks  any  contribution  from  Compton  scattering  but  does  have  a contribution  from  the  other  three 
processes.  [9] 


Figure  7-5.  Gamma  ray  interactions  in  aluminum  and  the  mass  absorption 
and  mass  attenuation  coefficients. [9] 
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Computer  codes  exist  which  are  capable  of  generating  data  on  the  total  as  well  as  individual 
contributions  to  the  attenuation  coefficient.  [10]  The  National  Institute  of  Standards  and  Technology 
has  published  (on  the  Internet)  tabular  and  graphical  data  on  the  mass  attenuation  and  mass  energy 
absorption  coefficients  for  all  of  the  elements  and  a large  number  of  materials  of  interest  in  radiation 
shielding,  radiation  measurement,  and  radiation  health  physics.  [1 1]  Figure  7-6  compares  the  mass 
attenuation  coefficient  and  the  mass  energy  absorption  coefficient  for  three  common  materials  (dry 
air,  water,  and  iron).  Note  the  influence  of  the  K-edge  (onset  of  photoelectric  emission  of  K-shell 
electrons)  on  the  iron  coefficients.  The  K-edges  of  elements  in  air  and  water  occur  at  photon 
energies  below  those  plotted  in  the  graphs. 

If  one  is  interested  in  determining  the  flux  of  radiation  passing  through  a surface  after  an 
initial  flux  has  passed  through  a mass  of  attenuating  material,  neither  the  attenuation  coefficient  nor 
the  absorption  coefficient  will  give  the  right  answer.  Use  of  the  absorption  coefficient  will 
underestimate  the  “attenuation”  of  the  radiation  flux,  but  use  of  the  attenuation  coefficient  will 
neglect  the  effect  of  small  angle  scattering  and  multiple  scattering.  A single  scattering  will  cause 
a particle  to  be  directed  out  of  the  beam,  but  additional  scatterings  may  cause  it  to  be  directed  back 
in  the  general  direction 


Figure  7-6.  Comparison  of  the  total  gamma  ray  mass  attenuation  coefficient  and 
the  energy  absorption  coefficient  for  selected  materials  [11]: 


a)  dry  air. 


Photon  Energy  (MeV) 


259 


Figure  7-6  (continued).  Comparison  of  the  total  gamma  ray  mass  attenuation  coefficient 
and  the  energy  absorption  coefficient  for  selected  materials: 


b)  water. 


c)  iron. 


Photon  Energy  (MeV) 


Photon  Energy  (MeV) 
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of  the  measurement  surface.  If  the  particle  is  only  scattered  through  a small  angle,  it  may  not  move 
laterally  by  enough  distance  to  miss  the  measurement  surface.  This  results  in  a “buildup”  factor  B 
such  that 


(j)i^x)- (/){jS)B{r,rj)e  (7.37) 

where  5 >1  is  a function  of  the  distance  r from  the  scattering  region  and  the  transverse  distance  Vj 
away  from  the  center  of  the  beam.  The  farther  the  measurement  point  is  from  the  attenuating  layer, 
the  less  important  either  of  these  scattering  effects  becomes  and  B approaches  unity.  The  interested 
reader  is  referred  to  texts  on  reactor  engineering  [1]  and/or  radiation  shielding  for  detailed 
discussions  of  determining  buildup  factors. 

The  primary  neutron  reactions  at  energies  below  10  MeV  are  elastic  scattering,  absorption 
(transmutation  to  a new  isotope  usually  followed  by  gamma  emission),  and  fission.  Because  the 
neutron  only  interacts  via  the  short-range  nuclear  force,  most  neutron  interactions  are  many-body 
problems  and  theoretical  expressions  for  cross  sections  are  usually  suspect.  However,  the  experi- 
mental nuclear  physics  community  has  compiled  empirical  data  on  virtually  every  element,  isotope, 
and  reaction  of  interest.  There  are  a number  of  data  compilations  foremost  among  which  may  be 
the  Evaluated  Neutron  Data  Files  (ENDF).  Today  these  are  readily  accessible  via  the  Internet  in 
either  tabular  or  graphical  form.  [12] 

Figure  7-7  shows  the  dominant  neutron  cross  sections  for  reactions  with  Aluminum-27  nuclei 
as  generated  by  ENDFPFOT  (one  of  the  on-line  graphical  routines).  [12]  Elastic  scattering  and 
absorption  (with  subsequent  gamma  emission)  are  strong  at  all  neutron  energies  between  thermal 
energy  (0.025  eV)  through  10  MeV.  Inelastic  scattering  (leaving  the  Al-27  nucleus  in  its  first 
excited  state  rapidly  becomes  significant  above  a threshold  of  about  850  keV.  The  (n,ctr)  transmuta- 
tion reaction  begins  to  become  significant  at  neutron  energies  above  5 MeV.  Figure  7-8  shows  the 
neutron  cross  sections  for  the  fissile  nucleus  Uranium-235.  Being  fissile,  the  fission  cross  section 
is  greater  than  the  absorption  cross  section  at  all  energies.  The  extremely  large  number  of 
resonances  (narrow  local  peaks  in  the  cross  section)  is  evident  in  both  the  fission  and  absorption 
cross  sections.  The  elastic  scattering  cross  section  becomes  the  dominant  cross  section  at  energies 
above  a few  keV.  No  other  neutrons  reactions  have  any  appreciable  cross  section  below  10  MeV. 

An  individual  gamma  ray  or  neutron  tends  to  deposits  much  of  its  energy  at  individual  points 
in  a medium.  That  is,  when  a gamma  ray  is  absorbed  by  the  photoelectric  effect,  all  of  its  energy 
is  transferred  to  the  photoelectron.  Charged  particles  tend  to  deposit  their  energy  gradually  as  they 
move  through  a medium.  The  linear  energy  transfer  (LET)  of  a charged  particle  is  the  amount  of 
energy  deposited  locally  in  a medium  per  unit  length  of  travel.  The  LET  may  be  determined  from 
the  ionization  and  excitation  cross  section  expressions. 
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The  rate  of  excitation  of  atoms  on  the  transition  n^m  is  given  by 


dN 


m 


dt 


\n^m 


j dE  N„  a, ^, „{£)<!>{£) 


(7.38) 


where  0(E)  is  the  charged  particle  flux  as  a function  of  kinetic  energy.  0(E)  will  take  different 
forms  depending  on  the  source  and  the  collisional  history  of  the  charged  particles.  The  rate  of 
ionization  will  look  similar. 

For  example,  consider  a single  non-relativistic,  high-energy  electron  (energy  »Eg» 
AE  for  all  possible  transitions).  The  average  rate  at  which  the  electron  collides  inelastically  with  a 
collection  of  atoms  of  number  density  p (cm‘^^  is 

Rate  = pGV  (7.39) 


where  v is  the  electron  velocity  and  cr  is  the  cross  section  integrated  over  all  possible  transitions 


n TOTAL  m 


J dE  ^(E) 


(7.40) 


If,  on  the  average,  the  electron  loses  energy  E in  each  collision,  whether  by  excitation  or  ionization, 
the  time  rate  of  energy  loss  is  given  by 


dE 

dt 


= - Epcrv 


(7.41) 


Since  the  electron  moves  at  velocity  v,  the  spatial  variation  of  the  energy  loss  is 


dE  _ dE 
dx  d {vt) 


= - Epa 


(7.42) 


Because  each  is  inversely  proportional  to  E at  high  energies  (we  will  ignore  the  In  E term  in 
the  cross  section),  cr  must  also  be  inversely  proportional  to  E.  Letting  a — <J  ^ ! E ,^nc  find 


dE  - _ 

= -Epa ^ / E (7.43) 

dx 


Upon  integration  we  obtain 
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(7.44) 


E[x)  - [eI  - lEpa^x^ 
or 

dE _ -Epa  ^ 

(El-2Epa,xp 

The  spatial  dependence  of  the  energy  deposition  is  shown  in  Figure  7-9.  The  dashed  curve  shows 
the  result  of  Eq.  (7.45),  while  the  solid  curve  is  indicative  of  the  behavior  in  a real  system.  Nature 
usually  figures  out  a way  to  eliminate  singularities.  In  this  case  the  discrepancy  between  the  real 
and  calculated  results  is  due  to  the  combined  effects  of  neglected  processes  including  elastic  scat- 
tering, electroionization,  and  the  probabilistic  nature  of  the  scattering  process.  In  any  case,  the 
energy  loss  rate  (equivalent  to  the  total  excitation  rate)  is  relatively  constant  at  first,  then  rises 
quickly  to  a mazimum  near  the  value 

Xmax  = Ell2Ep-^^  (7.46) 

and  then  falls  rapidly  to  zero.  The  peak  exhibited  by  Figure  7-9  is  commonly  called  the  Bragg  peak. 
The  result  Eq.  (7.46)  is  obtained  by  setting  E(x)=0m  Eq.  (7.44)  and  solving  for  x.  If  we  begin  with 
a nearly  monoenergetic  beam  of  high  energy  electrons,  such  as  may  be  produced  by  a particle 
accelerator  or  high  voltage  electron  beam  source,  then  for  much  of  the  way  into  the  medium  on  the 
way  towards  OfF)  will  remain  nearly  monoenergetic  with  the  energy  being  given  by  Eq.  (7.44). 

As  the  beam  nears  it  will  lose  its  monoenergetic  character,  gradually  becoming  thermal  in 
character  somewhat  past  x^^^.  The  LET  of  the  charged  particle  is  given  by  Eq.  (7.45).  The  range 
of  the  charged  particle  in  the  material  is  given  by  Eq.  (7.46). 
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f pix.E/E 


Figure  7-9.  Energy  deposition  by  a charged  particle. 


Radiation  Exposure  and  Dose 

Exposure  is  the  amount  radiation  present  at  the  site  of  an  individual  or  piece  of  equipment. 
We  measure  exposure  in  terms  of  Roentgens.  One  Roentgen  (R)  produces  2.08  x 10^  ion  pairs  (an 
ion  pair  is  an  electron  and  a positive  charged  ion)  per  cm^  of  dry  air  or  a total  deposited  charge  of 
2.58  X lO  "^  Coulombs  per  kg  of  dry  air. 

I R-  2.08  X 10^  ion  pairs  / cm^  = 2.58  x 10“"^  C / kg  (dry  air)  (7.47) 

There  is  no  SI  unit  of  exposure.  Technically,  Roentgen  units  apply  only  to  x-rays  or  gamma 
radiation.  However,  this  does  not  prevent  them  from  being  used  for  other  forms  of  ion-producing 
radiations.  Exposure  is  an  indication  of  the  potential  for  hazardous  conditions  to  exist.  A radiation 
source  of  a certain  specified  strength  will  always  produce  the  same  certain  exposure  when  present 
for  a specified  period  of  time  at  a specified  distance  from  the  volume  of  interest.  Because  time  is 
of  interest  in  an  exposure,  a more  common  source  description  is  in  Roentgens  per  hour  (R/hr). 
When  objects  or  personnel  are  given  an  exposure  to  a certain  amount  of  radiation,  they  absorb  a 
fraction  of  the  total  energy.  What  doesn’t  get  absorbed  cannot  produce  an  effect.  The  amount  of 
radiation  absorbed  per  unit  volume  is  called  the  dose.  Exposure  is  a measure  of  what  effects  might 
be,  dose  is  a measure  of  what  the  effects  will  likely  be.  Dose  rate  is  the  dose  accumulated  per  unit 
time. 


Dose  is  measured  in  SI  units  of  Grays.  One  Gray  (Gy)  is  1 J of  absorbed  energy  per 
kilogram  of  material.  A older  unit,  the  rad,  is  still  used  and  is  equal  to  100  ergs  absorbed  per  gram 
of  material.  One  Gray  equals  100  rads.  For  x-rays,  gamma  radiation,  and  beta  radiation,  it  has  be 
found  that  an  exposure  of  1 Roentgen  will  produce  a dose  of  1 rad  (1  cGy)  in  a person.  Exposure 
to  so  many  Roentgens  or  a dose  of  so  many  rads  will  produce  a specific  level  of  effect.  However, 
if  the  radiation  is  something  other  than  gamma  or  beta  radiation,  it  has  been  found  that  different 
levels  of  effects  are  observed.  Indeed,  it  has  been  found  that  fast  neutrons  are  far  more  destructive 
than  gamma  rays.  This  is  due  to  the  heavy  mass  of  the  neutron  imparting  considerable  recoil  energy 
(relative  to  gamma  rays)  to  nuclei  in  the  vicinity  of  its  interaction.  Recoiling  nuclei  can  break 
chemical  bonds  and  cause  substantial  cellular  damage.  For  this  reason  exposure  to  one  “Roentgen” 
of  neutron  radiation  has  the  same  biological  effect  as  exposure  to  perhaps  ten  Roentgens  of  x-rays. 
This  has  led  to  the  development  of  the  concept  of  dose  equivalent.  If  absorbed  dose  is  measured 
in  rads,  then  the  dose  equivalent  is  measured  in  rems  (Roentgen  equivalent  man)  where 

Dose  Equivalent  (rems)  = Absorbed  Dose  (rads)  x RBE  (7.48a) 

or 

Dose  Equivalent  (Sieverts)  = Absorbed  Dose  (Grays)  x RBE  (7.48b) 

where  RBE  is  the  relative  biological  effectiveness.  Many  texts  now  use  the  term  “quality  factor 
(QF)”  instead  of  RBE,  but  the  meaning  is  the  same  same.  The  SI  unit  of  dose  equivalent  is  the 
Sievert  (Sv).  In  general,  the  RBE  is  a function  of  the  particle  type  and  the  particle  energy.  RBE 
values  can  be  estimated  from  Table  7-3.  [13] 
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Table  7-3.  Relative  Biological  Effectiveness  (RBE)  or  Quality  Factors  (QF)  for  radiation.  [13] 


PARTICLE  TYPE  CONDITIONS  RBE  or  QF 


X-Rays,  Gamma  Rays 

All 

1 

Electrons,  Positrons 

All 

1 

Alpha  Particles  or 

LET  Values 

< 35 

2 

Heavy  Charged 

in  MeV/cm 

35 

2 

Particles 

in  Water: 

70 

4 

230 

10 

530 

20 

1750 

40 

> 1750 

40 

Neutrons 

Energy  in  MeV: 

Thermal 

5 

0.0001 

4 

0.01 

5 

0.1 

15 

0.5 

22 

1.0 

22 

10 

13 

The  absorbed  dose  can  be  calculated  from  the  expression 


1 r/O  _ ( 

p dx  \ P j 


(7.49) 


where  O is  the  energy  fluence  (J/cm^).  The  quantity  dQ)/dx  is  the  energy  deposited  per  unit  volume. 
Division  by  the  density  p converts  this  to  the  energy  deposited  per  unit  mass.  The  energy  fluence 
can  be  estimated  from  the  particle  flux  density  by  multiplying  by  the  average  particle  energy  and 
the  exposure  time.  A more  accurate  value  of  absorbed  dose  can  be  obtained  by  integrating  the 
product  of  the  particle  energy  times  the  mass  absorption  coefficient  (as  a function  of  energy)  times 
the  flux  density  as  a function  of  energy  times  the  exposure  time.  That  is. 


Absorbed  Dose  = - 


Pj 


eT(!){e) 


(7.50) 


where  E is  the  particle  energy,  T is  the  exposure  time,  and  <j)(E)  is  the  particle  flux  per  unit  area  per 
unit  time  per  unit  energy  interval. 
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Radioactive  Decay 

Many  of  the  atomic  nuclei  we  have  observed  on  earth  are  stable.  That  is,  they  have  existed 
in  their  current  form  without  change  since  they  were  formed  in  a supernova  explosion  long  before 
the  formation  of  our  solar  system.  However,  many  nuclei  are  unstable.  They  have  excess  energy 
relative  to  some  different  but  achievable  configuration  of  protons  and  neutrons.  Unstable  nuclei 
transform  themselves  into  more  stable  nuclei  through  any  one  of  a number  of  processes  collectively 
referred  to  as  radioactive  decay. 

Stable  nuclei  have  characteristics  that  are  moderately  well  defined.  As  illustrated  in  Figure 
7-10,  they  tend  to  have  a number  of  neutrons  that  is  slightly  larger  than  the  number  of  protons.  The 


Figure  7-10.  The  relationship  between  neutron  number  and  proton  number  in  stable  nuclei.  [1] 
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trend  illustrated  in  Figure  7-10  continues  well  beyond  the  last  stable  nucleus  shown.  If  a curve  were 
drawn  through  the  stable  nuclei  and  extrapolated  to  Z=120  or  so,  then  the  radioactive  nuclei  lying 
nearer  this  extrapolated  curve  would  have  longer  half-lives  than  those  lying  considerable  distances 
from  the  curve.  For  example,  Thorium-232  (Z=90,  N=142)  lies  exactly  where  the  extrapolated  curve 
would  intercept  the  upper  end  of  Figure  7-5.  It  has  a half-life  of  1 .4  x lO'*^  years,  long  enough  that 
even  though  Th-232  is  radioactive,  much  of  it  has  still  not  decayed  in  the  period  since  its  creation 
(long  before  the  birth  of  the  solar  system).  Curium- 247  (Z=96,  N=151)  has  a half-life  of  1.56  x 10’ 
years  and  lies  near  the  extrapolated  line.  If  we  were  to  plot  the  binding  energy  per  nucleon  as  a 
function  of  Z and  N,  we  would  find  that  the  maximum  binding  energies  would  superimpose  almost 
perfectly  on  this  stability  chart.  Since  binding  energies  are  actually  negative  energies,  the  curve  is 
sometimes  referred  to  as  the  “valley  of  stability”.  Several  other  empirical  observations  are  worth 
mentioning.  Nuclei  with  even  numbers  of  protons  and  even  numbers  of  neutrons  tend  to  be  more 
stable  than  nuclei  with  odd  numbers  of  neutrons  and  protons.  This  appears  to  result  from  pairing 
of  spins.  For  example,  a spin  up  proton  can  pair  up  with  a spin  down  proton  to  produce  a lower 
energy  state.  Furthermore,  there  are  certain  values  of  N and  Z that  seem  to  have  extra  stability. 
These  values  are  usually  called  “magic  numbers”  and  are  given  by 

NorZ  = 2,8,  20,  28,  50,  82,  126,  and  184. 

At  these  values  of  N and  Z theoretical  models  (so-called  “shell  models”  of  nuclear  structure  predict 
the  formation  of  closed  shells  with  complete  pairing  between  all  nuclear  particles  and  all  quantum 
numbers. 

Alpha  decay  is  the  spontaneous  emission  of  an  alpha  particle.  The  daughter  nucleus  has  2 
fewer  neutrons,  2 fewer  protons,  and  an  atomic  weight  4 less  than  the  parent  nucleus.  The  decay 
reaction  may  be  represented  as 

^ Decay  > ^-4  _ 2 (7.51) 

Alpha  decay  is  rare  in  nuclides  with  Z<82  (lead).  It  is  common  near  the  “valley  of  stability”  for 
elements  with  Z>82.  Given  the  shape  of  the  binding  energy  curve,  one  can  mathematically  ascertain 
that  alpha  decay  is  not  energetically  possible  for  ground  state  nuclei  with  A less  than  some  number. 
Empirically,  this  value  appears  to  be  roughly  A =200  which  corresponds  roughly  to  Z = 82. 

Beta-minus  (P‘)  decay  results  from  the  conversion  of  a neutron  into  a proton  with  the 
subsequent  production  and  emission  of  an  electron  and  an  electron  antineutrino.  The  decay  reaction 
is 

A- Decay  ^ T + l + &~  + (7.52) 

Beta-minus  decay  is  common  in  neutron-rich  nuclides  (those  lying  on  the  neutron-abundant  side  of 
the  valley  of  stability). 

Beta-plus  (P"^)  decay  results  from  the  conversion  of  a proton  into  a neutron  with  the 
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subsequent  production  and  emission  of  a positron  and  an  electron  neutrino.  The  decay  reaction  is 


Arrf N Decay  . A r-p  -i  N+l  , co\ 

zl  — z-1  ^ 1 ^ (7.53) 

Beta-plus  decay  is  common  in  neutron-deficient  nuclides  (those  lying  on  the  neutron-poor  side  of 
the  valley  of  stability). 

Electron  capture  is  the  conversion  of  a proton  into  a neutron  by  absorbing  and  inner  shell 
electron  and  emitting  a neutrino.  The  decay  reaction  is 

^r^-fe~(K  Shell)  Electron  capture  ^ ^ A y,  _ j 54^ 


Electron  capture  is  common  in  neutron-deficient  nuclides.  It  frequently  competes  with  beta-plus 
decay  for  decays  in  the  same  nuclide. 

Spontaneous  fission  is  the  splitting  (in  the  absence  of  any  initiating  particle)  of  a nucleus 
into  two  smaller  nuclei  plus  the  emission  of  a small  number  (typically  2 or  3)  free  neutrons.  The 
decay  reaction  is 


Arp  N 


Spontaneous  Fission  ^ Al 


» + 


Zl 


A-Al-X 

Z-Zl 


T2 


N-Nl-X 


y 


(7.55) 


With  the  exception  of  Beryllium-8  which  fissions  instantaneously  into  two  alpha  particles, 
spontaneous  fission  is  not  known  to  occur  below  thorium  Z=90.  It  becomes  more  and  more  likely 
as  Z increases.  Although  it  competes  with  alpha  decay,  spontaneous  fission  usually  is  the  much 
weaker  process. 


Deuteron,  triton,  and  Helium-3  emission  are  rare  modes  of  decay  in  which  the  parent 
nucleus  emits  a deuterium,  tritium,  or  Helium-3  nucleus  instead  of  an  alpha  particle.  The  decay 
reactions  are 


ArpN  Deuteron  Emission  A-2 

^ Z-1 


ArpN  Triton  Emission 

Z^ 


and 


Arp  N Helium-3  Emission  A-3 
Z^  ^ Z-2 


r-1  + (d) 

(7.56) 

-1  + (t) 

(7.57) 

r-2  -f 

(7.58) 

Isomeric  transition  is  the  decay  of  a metastable  (very-long-lived)  excited  nuclear  state  to 
the  ground  state  by  emission  of  a gamma  ray.  The  decay  reaction  is 


Arp  N 


Isomeric  Transition  Arp  N 


gnd 


+ y 


(7.59) 
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Isomeric  transitions  differ  from  other  decay  schemes  in  that  its  decay  product  is  the  same  nuclide 
as  the  parent.  The  product  atom  is  usually  electrically  neutral. 

Internal  conversion  is  the  decay  of  an  excited  nuclear  state  by  ejection  of  an  inner  shell 
electron  rather  than  emission  of  a gamma  ray.  The  decay  reaction  is 


ArpN 


+ e“  (K  Shell)  Conversion 


+ e (free) 


(7.60) 


Internal  conversion  differs  from  isomeric  transition  in  that  the  product  atom  is  an  electrically 
positive  ion.  Neither  changes  the  basic  nuclide. 

Details  on  the  characteristics  of  isotope  can  be  found  in  several  sources.  Summary  data  on 
decay  modes  and  half-lives  can  be  found  in  the  Table  of  the  Nuclides.  [8],  [14]  This  document  exists 
in  wall  chart,  pamphlet  [8],  and  electronic  form  [14].  Details  on  nuclear  energy  levels,  decay  modes 
including  secondary  particle  emissions,  and  half-lives,  can  be  found  in  the  Table  of  Isotopes.  [15] 

Activity  is  the  radioactive  disintegration  rate.  The  SI  unit  of  activity  is  the  Becquerel  (Bq) 
which  is  equal  to  one  (1)  disintegration  per  second.  An  older  unit  of  activity  that  is  still  in  common 
use  is  the  Curie  (Ci).  One  Curie  is  equal  to  3.7  x 10^°  disintegrations  per  second. 

1 Ci  = 3.7  X 10'°  Bq 

Unstable  (radioactive)  species  decay  to  other  unstable  species  until  ultimately  one  of  the  decay 
products  is  stable.  This  is  called  a radioactive  decay  chain. 

Let  us  examine  the  hypothetical  decay  chain  shown  in  Figure  7-11.  Nuclide  0 is  the  species 
formed  initially  in  some  production  system,  or  is  naturally  occurring.  It  is  referred  to  as  the  parent 
nuclide  of  the  decay  chain.  The  parent  decays  with  a characteristic  time  constant  Tq  to  nuclide  1. 
Nuclide  1 is  called  a daughter  as  are  the  products  of  all  subsequent  decay  processes.  Once  produced 
by  decay  of  the  parent,  nuclide  1 can  randomly  decay  to  nuclide  2 with  a characteristic  time  constant 
Tj.  The  decay  chain  continues  with  nuclide  M decaying  to  nuclide  M-i-1  with  characteristic  time 
constant  r^. 

The  quantity  of  each  nuclide  present  as  a function  of  time  requires  solution  of  the  rate 
equations  for  the  system.  Rate  equations  are  sets  of  equations  that  balance  the  rate  at  which  a 
species  is  created  or  produced  against  the  rate  at  which  that  species  decays.  Adjacent  to  each  level 
in  Figure  7-6  we  have  given  the  appropriate  rate  equation.  We  assume  that  we  start  with  a known 
quantity  of  nuclide  0,  Nq  that  was  either  collected  from  a production  source  or  purified  from  ore,  or 
other  source.  Since  there  is  no  further  production,  the  rate  equation  contains  only  a decay  term.  The 
rate  of  decay  is  assumed  to  be  proportional  to  the  number  of  Ng  present  and  inversely  proportional 
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Figure  7-11.  A typical  radioactive  decay  chain  and  the  rate  equations  associated  with  each  nuclide. 


(7.61) 


(7.62) 


dN„ 

dt 


N N 

M M-\ 

^ M ^ M-\ 


(7.63) 


N _ ^ N-\ 

dt  ^Af-i 


(7.64) 


to  the  decay  constant  Tq.  The  rate  equation  for  the  first  daughter  and  subsequent  radioactive 
daughters  has  a production  term  (assumed  due  solely  to  decay  from  nuclide  0)  and  a decay  term. 
The  final  stable  daughter  has  a production  term  but  lacks  a decay  term.  In  general,  it  is  difficult  to 
solve  coupled  differential  equations  in  closed  form.  Regardless,  we  will  attempt  solutions  using 
Laplace  transforms. 

Consider  first  the  decay  of  the  parent  nuclide.  Consulting  Appendix  E on  Laplace  transforms 
we  find  that  Eq.  (7.61)  can  be  transformed  to  yield 

5Lko(.y)  - ^o(O)  = ! ^0  (7-65) 
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Solving  this  equation  for  ^q{s)  yields 


jV^[s)  - jV^[0)  ! s + 


Using  the  inverse  transforms  gives  the  final  result 


v„(0  = v„(o)< 


(7.67) 


The  parent  decays  with  simple  exponential  form  and  characteristic  decay  time  Tq.  The  time  constant 
ris  related  to  the  half-life  t,^2 » the  time  it  takes  for  half  of  the  nuclide  to  decay,  by  the  relation 


=(ln2)r  = 0.69315  r 


(7.68) 


Using  the  half-life  instead  of  t or  vice  versa  is  a common  mistake.  Forewarned  is  forearmed. 

The  rate  equation  for  the  first  daughter  nuclide  can  also  be  Laplace  transformed  to  yield 


SfA[[s)  - Ay[0) 


(7.69) 


Simple  combining  of  terms  yields 


(7.70) 


which  is  easily  solved 


Ul/1  rls.V 


(7.71) 


We  have  already  obtained  a solution  for,/fo('^)»  Eq.  (7.66),  which  is  substituted  into  Eq.  (7.71). 


,A(s) 


./ll(O) 


Ta\  S + 


(7.72) 


This  equation  can  be  solved  by  applying  the  appropriate  inverse  Laplace  transforms  to  yield 
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N,{t)  = N,{0)e-^'^'  + 


V^o 


^o(0)( 


(7.73) 


The  rate  equations  for  the  second  daughter  and  all  succeeding  daughters  (except  the  last  stable  one) 
follow  a similar  procedure.  The  rate  equation  for  the  M*  daughter  is  easily  seen  to  be  Laplace 
transformed  to  yield 


5*  + 


M 


M-l 


1/  ^ 
/ '^  M J 


(7.74) 


Earlier  results  are  repeatedly  substituted  into  the  expression 

1 


f 


s + 


M 


M-l 


s + 


V 


M J 


X 


s + 


M-l 


M-2 


S + 


M-l ) 


(7.75) 


until  all  (*^)  replaced  by  (0) 


The  end  result  has  the  form 


'^M-l  (^) 


y ) 

^ M 


Y-i;: 

M J 


V / M-l, 
^M-ii^ 


^M-l^M-2 


s+  y u + 

V /^M-2j\ 


\f  w A 
s + 

M-l  J\  / ‘'  M J 


(7.76) 


+ 


^o(O) 


^M-l^M-2  ■ ■ ■ ^0 


■y+  / 

V /^O, 


1/  Y ^ ^ ^ 

s+  V_  s + 

M-l  J\  / ^ M J 
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In  theory,  inverse  transformation  of  this  equation  will  yield  the  time  dependent  solution  for  NJ^t). 


The  specific  result  for  the  second  daughter  is  given  below 


^(0) 


k I ^1 
^ 2 


^ 1/  Y 1/ 

s+  y s+  y 

V /^lA  /Ay 


A)(o) 


T^Tq 


s+  y'  s+  y 

V /AA  / '^\j 


s + 


2j 


(7.77) 


with  inverse  form 


V,(f)  = vy0)e 


-t/T2 


+ 


V ^1  ^2  y 


w,(0)( 


g->". 


No(0) 


T^Tq 


1 1 


\r 


V ^0  ^1 y 


1 1 


V^i  Ay 


1 1 


V ^2  ^0  y 


(7.78) 


X 


P 


v^i  ^2  y 


P 


VA  ^oy 


^1 


v^o  ^1  y 


-tlz^ 


It  would  be  nice  to  be  able  to  continue  this  process  for  subsequent  daughters.  However,  a closed 
form  inverse  transform  to  Eq.  (7.76)  does  not  exist  for  M>3. 


The  analysis  can  be  carried  a little  further  if  we  make  an  approximation.  We  will  assume 
steady  state  behavior.  In  essence  we  will  assume  that  so  much  parent  exists  and  its  half-life  is  so 
long  that  it  will  essentially  never  disappear.  We  will  assume  that  all  other  half-lives  are  short 
compared  to  the  parent  half-life.  In  the  steady  state  all  time  derivatives  may  be  assumed  to  be  equal 
to  zero.  There  are  two  realistic  initial  conditions.  In  the  first,  the  radioactive  parent  has  been  freshly 
processed.  For  example,  it  has  been  freshly  mined,  isolated  chemically  from  other  elements,  and 
enriched  to  eliminate  other  nuclides.  In  this  case,  at  t=0,  only  the  parent  is  present.  That  is. 


(7.79) 


The  total  activity  is  given  by 
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Activity 


(7.80) 


Thus,  per  unit  mass  of  parent  nuclide,  the  activity  of  the  raw  ore  is  times  stronger  than  the  activity 
of  the  purified  parent  nuclide. 
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Problems 


7-1.  One  will  encounter  six  to  eight  fundamental  particles  in  discussing  ordinary  nuclear 
reactions.  List  these  particles  in  order  of  their  likelihood  of  being  encountered. 

7-2.  The  mass  energy  of  a Uranium-238  nucleus  is  238.050785  amu.  What  is  the  binding  energy 

(in  MeV)  per  nucleon  of  Uranium-238?  Compare  this  result  with  that  for  Uranium-235. 

7-3.  Estimate  the  energy  released  in  a single  fission  reaction  of  the  form 
on‘  -I-  92U^^^  ->  3 -I-  44Ru“*’  -I-  4gCd‘^^ 

This  energy  should  not  depend  strongly  on  exactly  which  fission  products  are  produced. 

7-4.  The  purpose  of  the  moderator  in  a nuclear  reactor  is  to  slow  down  the  fast  neutrons  produced 

by  fission.  Elastic  scattering  is  the  process  used  for  slowing  the  neutrons.  Which  is  likely 
to  make  the  best  moderator:  polyethylene,  aluminum,  or  lead?  That  is,  which  will  produce 
the  largest  energy  loss  per  elastic  scattering  event? 

7-5.  Which  reaction  is  most  likely  to  dominate  the  attenuation  of  gamma  rays  in  the  10  keV 
energy  region?  In  the  1 MeV  energy  region?  In  the  100  MeV  energy  region? 

7-6.  A hypothetical  material  has  a nuclear  reaction  cross  section  of  lO'^"^  cm^  and  a density  of  10^^ 
atoms/cm^.  How  thick  does  a foil  of  this  material  have  to  be  to  cause  at  least  10%  of  the 
particles  striking  it  to  undergo  the  nuclear  reaction? 

7-7.  The  mean  free  path  is  the  average  distance  a particle  can  travel  before  it  undergoes  a 
reaction.  The  mean  free  path  / is  given  by 

I = 1/Vcr 

where  N is  the  atomic  density  and  cr  is  the  cross  section  for  the  particle  interaction  with  the 
medium.  Calculate  the  mean  free  path  of  1 keV  neutrons  in  aluminum. 

7-8.  Calculate  the  mean  free  path  for  neutron-induced  fission  reactions  in  Uranium-235.  The 
energies  of  fission  neutrons 

7-9.  A Cobalt-60  radiography  source  has  an  activity  of  1000  Curies.  Each  disintegration 
produces  a 1.5  MeV  gamma  ray.  How  thick  must  a lead  shield  be  to  reduce  the  human  dose 
rate  to  1 mrem/hr  if  the  source  is  1 m from  the  radiographer’s  control  station? 
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7-10.  Using  the  Table  of  Isotopes  or  the  Chart  of  the  Nuclides  or  similar  reference,  describe  the 
radioactive  decay  chain  for  Uranium-238.  That  is,  list  the  decay  modes,  daughters,  and  half- 
lives  of  all  decays  until  a stable  isotope  daughter  is  reached.  Does  this  decay  chain  have  any 
weak  side  chains? 

7-11.  A nucleus  has  ten  more  neutrons  than  an  isotope  of  the  same  element  lying  near  the  valley 
of  stability.  What  radioactive  decay  mechanism  is  this  isotope  likely  to  employ? 

7-12.  An  element  heavier  than  lead  lies  near  the  valley  of  stability.  What  radioactive  decay 
mechanism  is  this  isotope  likely  to  employ? 

7-13.  A sample  of  uranium  ore  is  determined  to  contain  one  mole  of  Uranium-23 8 . If  no  chemical 
or  nuclear  processing  of  this  ore  has  occurred  in  the  last  billion  years,  what  weight  of 
Radium-226  (a  daughter  of  U-238  decay)  should  this  ore  sample  contain? 
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CHAPTER  8 


RADIOMETRY,  PHOTOMETRY,  & RADIOMETRIC  ANALYSIS 


Radiometry 

Radiometry  is  the  science  of  electromagnetic  radiation  measurement.  Our  interest  in 
radiometry  lies  in  two  areas.  First,  anyone  involved  with  sensor  design,  selection,  or  integration 
needs  to  be  familiar  with  the  terminology  used  in  radiometry.  These  terms  permeate  sensor  design 
discussions.  Second,  radiometric  analysis  is  a powerful  tool  that  often  lets  one  determine  the 
characteristic  range  equation  of  a sensor. 

The  nomenclature  of  radiometry  is  governed  by  international  standards  [l]-[3].  The 
fundamental  quantities,  their  symbols,  essential  definition,  and  units  are  summarized  in  Table  8-1. 
The  fundamental  quantity  is  energy  with  SI  units  of  Joules.  The  energy  density  is  the  energy 
contained  in  a unit  volume.  This  is  equivalent  to  the  derivative  (change)  of  total  energy  with  respect 
to  volume.  It  has  units  of  J/m^  Power  or  flux  is  the  time  rate  of  change  of  the  total  energy.  Flux 
has  units  of  Watts.  Flux  is  usually  measured  at  a surface.  Flux  density  is  the  flux  passing  through 
a unit  area.  This  is  equivalent  to  the  derivative  of  the  flux  with  respect  to  surface  area.  It  has  units 
of  W/m^.  Flux  density  either  comes  out  of  a surface  (it  is  emitted  by  the  surface)  in  which  case  it 
is  called  radiant  exitance  or  it  is  incident  on  a surface  in  which  case  it  is  called  irradiance.  Note 
that  radiant  exitance  and  irradiance  are  given  different  symbols  to  facilitate  discrimination  between 
the  direction  of  energy  flow  at  the  surface.  If  the  power  flows  through  a surface  at  an  angle,  the 


Table  8-1.  Standardized  radiometric  quantities  and  units. 


QUANTITY 

SYMBOL 

DEFINING  EQUATION 

SI  UNIT  NAMES 

UNIT  SYMBOLS 

ENERGY 

Qe 

... 

JOULE 

J 

ENERGY  DENSITY 

dQg  /dV 

... 

J/m^ 

FLUX  (POWER) 

*e 

dQg  /dt 

WVTT 

W IJ/s] 

FLUX  DENSITY  (AT  A SURFACE) 

- RADIANT  EXITANCE  (EMITTED) 

' IRRADIANCE  (INCIDENT) 

M, 

dtg/dA  coftd 

— 

W/m^ 

RADIANT  INTENSITY 

'e 

dtg/dO 

--- 

W/sr 

RADIANCE 

•-e 

dA  COS0 

... 

W/m^  -sr 
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actual  surface  area  must  be  corrected  by  the  cosine  of  the  angle  ^with  respeet  to  the  normal  to  the 
surface  to  yield  the  projeeted  surfaee  area.  This  accounts  for  the  eos^term  in  the  denominator  of 
the  defining  equation.  Radiant  intensity  is  the  flux  per  unit  solid  angle  (i.e.,  the  derivative  of  total 
flux  with  respeet  to  solid  angle).  It  has  units  of  W/sr.  Solid  angle  is  a produet  of  azimuthal  and 
elevation  angular  extent.  It  relates  the  direetionality  in  three-dimensional  spaee  of  flux  away  from 
a point  or  flux  towards  a point.  The  magnitude  of  a solid  angle  is  the  area  of  the  surface  encompass- 
ing the  flux  projeeted  on  a unit  sphere  (the  unit  sphere  has  a radius  = 1).  The  entire  unit  sphere  has 
an  area  of  4ti(1)^  = 4n  and  is  therefore  defined  to  have  4ti  steradians  (sr)  of  solid  angle.  Radiance 
is  a quantity  which  describes  the  derivative  of  total  energy  with  respect  to  time,  surfaee  area,  and 
solid  angle.  It  has  units  of  W/m^-sr. 

Table  8-1  uses  energy  as  the  basic  unit  for  defining  the  radiometrie  quantities.  There  is  an 
alternative  formulation  which  uses  the  total  number  of  eleetromagnetie  quanta  as  the  defining 
element.  In  this  formulation,  “Joules”  would  be  universally  replaced  by  “Number  of  Photons”. 
Otherwise  the  definitions  remain  the  same.  The  subscript  “e”  in  Table  8-1  explicitly  indicates  that 
the  quantities  use  the  energy-based  formulation.  Quantities  using  the  quantum-based  formulation 
customarily  have  a subseript  “q”  rather  than  “e”.  If  no  subseript  is  used,  then  the  energy-based 
formulation  should  be  assumed. 


A subseript  “T”  is  used  to  denote  a spectral  quantity,  i.e.,  a quantity  with  explieit  wavelength 
dependenee 


Quantity 

Unit  wavelength  interval  centered  at  X 


(8.1) 


The  spectral  and  spectrum-integrated  quantities  are  interrelated  by  the  expressions 


In  radiometric  analyses,  it  is  often  necessary  to  convert  from  one  radiometrie  quantity  to 
another  radiometric  quantity.  For  example,  we  may  have  a measurement  of  radiant  exitance  but 
desire  to  know  the  flux.  Figure  8-1  shows  the  interrelations  between  each  of  the  radiometrie 
quantities.  To  eonvert  from  one  to  another  follow  any  path  from  the  initial  quantity  to  the  final 
quantity  performing  the  indicated  operations  as  you  go.  For  example,  to  convert  from  radianee  to 
flux,  one  could  integrate  first  over  area  to  get  radiant  intensity  and  then  integrate  over  solid  angle 
to  get  flux.  Alternatively,  one  eould  integrate  over  solid  angle  to  get  radiant  exitance  and  then 
integrate  over  area  to  get  flux. 
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Figure  8-1.  Relationships  between  radiometric  quantities. 


So  far  the  discussion  has  addressed  only  quantities  associated  with  the  radiation.  Obviously, 
in  any  radiometric  analysis,  material  properties  must  be  considered.  There  are  four  material 
properties  relevant  to  radiometry:  absorptance,  reflectance,  transmittance,  and  emissivity.  The 
absorptance  a of  a material  is  the  ratio  of  radiant  energy  absorbed  by  the  material  to  the  radiant 
energy  incident  on  the  material 

(8.4) 

The  reflectance  /?  of  a material  is  the  ratio  of  radiant  energy  reflected  by  the  material  to  the  radiant 
energy  incident  on  the  material 

_ QrEFLECTED  /o 

p - — • (»-J) 

^INCIDENT 

The  transmittance  r of  a material  is  the  ratio  of  radiant  energy  transmitted  through  a material  to  the 
radiant  energy  incident  on  the  material 
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T = 


Q 


TRANSMITTED 


(8.6) 


Qincident 

The  fourth  quantity,  emissivity  is  slightly  different  in  general  nature.  The  emissivity  ^•of  a material 
is  the  ratio  of  the  energy  radiated  (emitted)  from  a material  to  the  energy  that  would  have  been 
emitted  had  the  material  been  a blackbody 

^ _ Qemitted 


a . 


(8.7) 


Qblackbody 

In  a later  section  of  this  chapter  we  will  discuss  blackbody  radiation  in  detail.  At  this  point  it  should 
suffice  to  say  that  blackbodies  have  unique  properties  that  make  them  suitable  as  absolute  (primary) 
standards  for  radiation  measurements.  By  definition,  blackbodies  have  s = 1. 


Emissivity  is  sometimes  called  emittance  in  analogy  with  absorptance,  reflectance,  and 
transmittance,  although  emissivity  remains  the  much  more  common  term.  In  the  same  light,  some 
individuals  will  use  the  terms  absorptivity,  reflectivity,  and  transmissivity,  instead  of  absorptance, 
reflectance,  and  transmittance.  Widespread  usage  of  these  three  alternative  forms  has  eliminated 
any  serious  connotations  of  error  in  their  use,  nevertheless,  emissivity,  absorptance,  reflectance, 
and  transmittance  are  the  officially  accepted  terms. 


In  Eq.  (8.7)  we  have  also  alluded  to  the  fact  that  emissivity  must  equal  absorptance.  This 
fact  is  required  in  order  to  establish  thermal  equilibrium  in  material  systems  in  contact  with  a 
radiation  field.  Since  reflection  and  transmission  do  not  add  or  subtract  energy  from  a material,  they 
cannot  change  its  temperature.  If  absorption  exceeded  emission,  then  a material  isolated  from 
everything  but  radiation  would  continue  to  absorb  energy  at  a faster  rate  than  it  could  emit  it.  The 
temperature  would  steadily  rise  towards  infinity.  If  emission  exceeded  absorption,  then  the  object 
would  radiate  energy  away  faster  than  it  could  absorb  it.  The  temperature  would  steadily  fall 
towards  absolute  zero.  Since  neither  extreme  is  physically  realistic,  it  follows  that  absorption  must 
at  some  point  be  equal  to  emission. 


The  relation 


(8.8) 


is  just  as  easy  to  justify.  Since  energy  must  be  conserved,  if  it  isn’t  reflected,  and  it  isn’t  absorbed, 
and  it’s  not  allowed  to  transmit  unaffected  through  the  material,  then  it  must  not  have  existed  in  the 
first  place  since  there  are  no  other  alternatives. 

Another  powerful  concept  in  radiometry  is  the  Lambertian  source  or  Lambertian  surface. 
A Lambertian  surface  has  the  characteristic  that  the  radiant  intensity  in  a direction  ^relative  to 

the  normal  to  the  surface  has  a smaller  value  by  a factor  of  than  the  radiant  intensity  normal 
to  the  surface  That  is, 

E = Ecos0  (8.9) 
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Eq.  (8.9)  is  known  as  Lambert’s  Law  for  diffuse  surfaces.  The  co^ ^dependence  holds  whether  the 
radiant  intensity  arises  from  emission  or  reflection.  If  due  to  reflection  it  is  unaffected  by  the 
direction  of  illumination.  The  total  radiant  flux  into  a hemisphere  around  a Lambertian  surface  is 
found  by  integration  to  be 

O = J r/Q dd  0,0^6  ^m.6  ^d(j)  1^- n (8.10) 

That  is,  the  flux  is  effective  radiated  into  ;rsteradians  (or  half  of  the  solid  angle  of  a hemisphere. 
Essentially,  Lambertian  surfaces  have  no  preferred  directivity  for  either  emission  or  reflection. 

The  geometry  associated  with  the  integration  above  is  shown  in  Ligure  8-2.  The  relation 
between  the  emitting  element  of  area  and  the  measurement  surface  element  are  clearly  shown. 


Figure  8-2.  Geometry  of  emission  from  a Lambertian  surface. 
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Another  interesting  property  of  Lambertian  surfaces  is  that  the  radiance  is  independent  of 
angle.  LFsing  our  earlier  definitions  we  find 


61  I„  I, cose  I„ 

dCldAcosO  dAcos0  dA  cos  9 dA  cos  9 dA 


L (8.11) 


Finally,  for  Lambertian  surfaces,  simple  relationships  exist  between  radiance,  radiant  intensity, 
radiant  flux,  and  radiant  exitance.  That  is. 


^ ^ M 

dA  71  dA  71 


(8.12) 


Because  of  this  assumption  of  a Lambertian  surface  will  always  simplify  radiometric  calculations. 

It  is  instructive  to  question  what  makes  a surface  Lambertian  and  are  any  real  surfaces 
Lambertian?  It  is  evident  from  the  reflection  characteristics  that  mirror-like  surfaces  are  not 
Lambertian.  Lambertian  surfaces  diffuse  any  light  that  they  reflect.  Matte  surfaces  are  more  likely 
to  be  Lambertian.  Physically,  if  the  surface  height  is  random  and  rough  on  the  scale  of  a wavelength 
of  the  radiation  of  interest,  and  has  a transverse  correlation  length  that  is  less  than  the  wavelength 
of  the  radiation,  the  surface  will  be  Lambertian.  Translated  into  common  language,  the  peak-to- 
valley  variation  in  height  must  be  larger  than  a wavelength,  height  measurements  taken  at  separated 
points  must  exhibit  random  variations,  and  you  cannot  move  more  than  one  wavelength  along  the 
surface  before  the  height  changes  from  a relative  peak  to  a relative  valley.  It  is  difficult  to  produce 
a surface  that  truly  possesses  this  character,  however,  most  surfaces  approximate  this  character  if 
the  wavelength  is  small  enough.  Although,  we  will  not  prove  it,  blackbody  radiators  are  Lambertian. 
They  exhibit  the  fundamental  behavior. 
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Photometry 


Photometry  is  simply  radiometry  complicated  by  the  fact  the  ultimate  detector  is  the  human 
eye.  Thus,  photometric  quantities  must  take  into  account  the  spectral  response  of  the  eye.  As  we 
shall  see,  this  is  a significant  complication.  Nevertheless,  the  combat  systems  engineer  must  be 
familiar  with  both  radiometric  measurements  as  well  as  photometric  measurements. 

For  every  radiometric  quantity,  there  is  a corresponding  photometric  quantity.  These  are 
summarized  in  Table  8-2.  It  is  a standard  convention  to  denote  the  photometric  quantities  by  using 
the  radiometric  symbols  with  the  subscript  “v”  instead  of  the  subscript  “e”  or  or  no  subscript  at 
all.  The  fundamental  photometric  quantity  is  the  luminous  energy.  It  is  related  to  the  radiant 
energy  by  the  relation 

^0.760/mi  , . 

0,  = [ dXK(X)Q^,  (8.13) 

J0.380/m  ^ ^ 

where  K(A)  is  the  spectral  luminous  efficacy  is  a factor  that  relates  radiant  energy  (in  J)  at  a specific 


Table  8-2.  Standardized  photometric  quantities  and  units. 


QUANTITY 

SYMBOL 

DEFINING  EQUATION 

UNIT  NAMES 

UNIT  SYMBOLS 

LUMINOUS  ENERGY 
(QUANTITY  OF  LIGHT) 

Q» 

0.760 

J K(X)  Q.xdX 

0.380  fim 

TALBOT 

[Im  «1 

LUMINOUS  ENERGY  DENSITY 

dQy  /dV 

--- 

3 

Im  e/m 

LUMINOUS  FLUX 

mm 

dQy  /dt 

LUMEN  (SI) 

Im 

LUMINOUS  FLUX  DENSITY 

- LUMINOUS  EXITANCE  (EMITTED) 

- ILLUMINANCE  (INCIDENT) 

d*y/dA 

LUX  (SI) 

PHOT 

FOOT  CANDLE 

lx  [Im/m^  1 
ph  Ilm/cm®  1 
ft  c Ilm/ft®  I 

LUMINOUS  INTENSITY 

* V 

d*y/dO 

CANDELA  (SI) 

cd  [Im/srI 

LUMINANCE 

(PHOTOMETRIC  BRIGHTNESS) 

•-V 

d**y/dO  dA  co®0 

VALID  FOR  LAMBERTIAN  1 
SOURCES  ONLY  J 
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wavelength  to  the  luminous  energy  (in  Talbots  or  lumen-seconds).  In  essence  the  luminous  efficacy 
is  visual  response  of  the  eye.  The  conversion  from  radiant  energy  to  luminous  energy  only  needs 
to  consider  energy  at  wavelengths  in  the  visible  spectrum.  Hence,  the  limits  of  integration.  The 
luminous  efficacy  is  plotted  in  Figure  8-3.  There  are  two  curves:  one  for  scotopic  vision  (the  black 
& white  nighttime  vision  produced  by  the  rod  cells  in  the  human  retina)  and  one  for  photopic  vision 
(the  color  daytime  vision  produced  by  the  cone  cells). 


Figure  8-3.  Spectral  luminous  efficacy  curves  for  the  two  form  of  human  visual  response. 


WAVELENGTH  (nm) 


Several  of  the  photometric  quantities  have  units  that  are  valid  SI  units.  Most  do  not.  Almost 
all  of  the  quantities  have  alternate  units  that  are  not  SI.  In  general  these  alternate  units  are  obtained 
by  substituting  feet  or  centimeters  for  the  SI  meters.  Some  of  the  alternate  units  of  luminance  have 
an  additional  factor  of  \!k  that  accounts  for  Lambertian  emission  into  k steradians.  Two  additional 
quantities  are  listed  in  Table  8-2  that  have  no  radiometric  counterparts.  The  spectral  luminous 
efficiency  (as  opposed  to  efficacy)  is  the  relative  response  of  the  eye.  The  photopic  and  scotopic 
spectral  luminous  efficiency  curves  are  presented  in  Figure  8-4.  The  final  photometric  quantity  is 
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Figure  8-4.  Spectral  luminous  efficiency  of  human  vision. 


350  400  450  500  550  600  650  700  750  800 


WAVELENGTH  (nm) 


retinal  illuminance.  This  quantity  is  the  luminance  multiplied  by  the  area  of  the  pupil  (of  the  eye). 
It  is  a measure  of  how  much  luminous  intensity  actually  illuminates  the  retina.  It  is  commonly 
encountered  in  studies  of  external  degradation  of  the  human  visual  system  such  as  flashblindness. 
All  of  the  photometric  quantities  with  radiometric  counterparts  are  used  in  vision  system  analyses 
exactly  like  their  radiometric  counterparts  are  used  in  other  analyses.  Photometric  quantities  can 
be  transformed  between  themselves  exactly  like  the  corresponding  radiometric  quantities  using  the 
relations  in  Figure  8-1. 
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Radiometric  Analysis  - Range  Equations 

Radiometric  analysis  is  the  science  of  accounting  for  the  flows  of  energy  involved  in  the 
operation  of  any  sensor  system  using  radiation  (electromagnetic,  acoustic,  or  nuclear)  as  a 
propagator.  In  some  instances,  radiant  energy  may  flow  from  the  sensor  itself  or  some  other  source, 
interact  with  the  target,  and  flow  back  to  the  sensor.  In  others  the  target  may  produce  the  energy  that 
ultimately  flows  to  the  sensor.  When  coupled  with  an  expression  for  the  intrinsic  noise  of  the 
sensor,  the  radiometric  analysis  result  produces  the  carrier-to-noise-ratio  equation  for  the  sensor. 
This  is  the  single  most  important  equation  involved  in  design. 

Many  factors  affect  the  flow  of  radiation.  Electronic  components  may  produce  gain  or  loss 
of  energy  (power).  Losses  may  come  reflection  or  absorption  of  energy.  Either  mechanism  has  an 
adverse  effect  on  performance,  but  reflection  may  produce  unwanted  side  effects.  Propagation 
through  the  medium  intervening  between  sensor  and  target  can  cause  attenuation  through  scattering 
or  absorption.  Imperfect  collimation  and/or  diffraction  may  cause  the  intensity  to  decrease  through 
spreading.  Turbulence  or  multipath  effects  will  cause  some  of  the  energy  to  go  in  unwanted 
directions.  Interaction  of  external  radiation  with  a target  will  not  reflect  isotropically.  Antennas 
used  for  collected  signal  radiation  will  have  angle-dependent  collection  efficiency.  Consideration 
of  all  of  these  effects  and  factors  is  the  province  of  radiometric  analysis. 

The  basic  approach  is  to  begin  at  the  point  most  distant  from  the  sensor  output  in  terms  of 
propagation.  Note:  in  an  active  sensor,  this  most  distant  point  may  in  fact  be  spatially  close  to  the 
output  point.  The  flow  of  radiation  is  then  traced  element  by  element  accounting  for  all  gains  and 
losses,  reflections,  redirections,  and  spreading.  The  power  remaining  at  the  output  point  is  the 
average  signal  power.  Dividing  the  expression  representing  this  result  by  the  expression  for  the 
sensor  noise  power  gives  the  CNR  equation. 

Figure  8-5  illustrates  the  radiometric  analysis  that  yields  the  CNR  equation  for  a conven- 
tional microwave  radar.  This  result  is  usually  called  the  radar  range  equation.  [4]  The  analysis 
begins  at  the  transmitter  unit  with  output  power  Pj.  The  waveguide  and  other  transmit  components 
introduce  a loss  factor  (defined  in  this  instance  as  the  ratio  of  output  power  to  input  power).  Thus 
the  antenna  is  seen  to  radiate  an  effective  power  P.  If  this  power  were  to  be  radiated  by  an 
omnidirectional  antenna  (one  that  radiates  its  power  into  4ti  sr),  the  power  density  (intensity)  at 
distance  R is  the  effective  power  divided  by  the  factor  47tR^.  The  real  antenna  is  assumed  to  have 
an  antenna  gain  defined  by 

Gj=A7tA^!X^  (8.14) 

The  antenna  gain  is  the  ratio  of  the  solid  angle  of  a sphere  divided  by  the  effective  solid  angle  into 
which  the  antenna  radiates.  Thus,  the  effective  power  density  of  the  radar  at  range  R is  increased 
by  the  antenna  gain.  As  the  power  density  propagates  to  the  target,  which  is  assumed  to  be  at  range 

R,  it  suffers  atmospheric  extinction  of  magnitude  . The  target  presents  an  effective  area,  called 
the  radar  cross  section  <t,  to  the  incident  radiation.  The  total  reflected  power  is  the  product  of  the 
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Figure  8-5.  Radiometric  derivation  of  the  radar  equation. 
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incident  power  density  and  this  effective  area.  The  power  is  assumed  to  be  reflected  into  4n  sr.  Any 
directivity  in  the  reflection  is  already  accounted  for  in  the  definition  of  the  radar  cross  section.  As 
the  reflected  radiation  spreads  out  from  the  target  and  propagates  back  to  the  receiver,  it  suffers 

another  1/4 ttR^  spreading  loss  and  another  atmospheric  attenuation  loss.  The  resulting  power 
density  is  then  incident  on  the  receive  antenna.  The  product  of  the  power  density  and  the  area  of  the 
receive  antenna  is  the  total  received  power.  This  power  is  reduced  by  any  losses  in  the  receiver 
electronics  At  this  point  we  now  have  an  expression  for  the  overall  signal  power.  Dividing  by 
the  known  receiver  noise  kTBF  (where  T is  the  electronics  temperature,  B is  the  noise  bandwidth  of 
the  electronics,  and  F is  an  excess  noise  term  that  must  be  included  to  describe  real  amplifiers), 
yields  the  radar  range  equation 


P 

CNR  = 

kTBF 


kTBF 


(8.15) 


Radiometric  analysis  of  any  other  sensor  proceeds  in  the  same  straightforward  manner  as  our  radar 
example. 

In  summary,  radiometric  analysis  is  one  of  the  powerful  and  universal  tools  available  to  the 
combat  systems  engineer.  It  lets  him  derive  a CNR-based  range  equation  for  any  radiation-based 
sensor. 
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Blackbody  Radiation 

It  is  not  difficult  to  make  a blackbody.  The  engineer  merely  needs  to  make  something  that 
will  absorb  all  (or  at  least  the  vast  majority  of)  radiation  incident  upon  it.  A large  hollow  ball  with 
a small  hole  is  a good  example.  Any  radiation  passing  through  the  hole  into  the  ball  will  strike  the 
interior  of  the  ball  and  reflect  many  times  before  it  can  escape  out  the  small  hole.  Since  no  material 
is  a perfect  reflector,  even  a few  percent  absorption  multiplied  by  dozens  of  reflections  lead  to 
almost  complete  absorption.  Indeed,  thirty  years  ago,  teachers  used  to  make  excellent  blackbodies 
by  stacking  a hundred  (or  so)  double-edged  razor  blades  side  by  side.  The  very  shallow  taper  of  the 
blades  from  body  to  sharp  edge  meant  the  any  radiation  striking  the  face  with  all  the  sharp  edges 
would  undergo  many  reflections  off  the  shiny  metal  (from  one  blade  to  the  adjacent  blade  and  back 
again.  Despite  the  high  reflectivity  of  the  metal,  the  edged  side  of  the  stack  appeared  to  be  a deep, 
velvety  black.  Heating  the  blade  stack  would  provide  a nice  blackbody  radiator  for  infrared 
experiments.  Unfortunately,  it  is  almost  impossible  to  find  double-edged  razor  blades  today. 
Technology  has  made  them  obsolete  for  their  original  intended  purpose.  Scientists  are  interested 
in  blackbodies  because  a = s implies  that  a perfect  absorber  (a  “black”  body)  is  also  a perfect 
emitter.  The  radiant  exitance  from  a blackbody  is  a function  only  of  the  temperature  and  the 
wavelength  of  the  radiation.  The  material  characteristics  are  completely  eliminated.  The  spectrum 
and  “brightness”  of  one  blackbody  at  temperature  T is  exactly  the  same  as  the  spectrum  and 
brightness  of  any  other  blackbody  at  temperature  T. 

For  these  reasons,  not  only  are  blackbodies  used  as  emission  standards  in  radiometric 
measurements,  they  have  also  been  the  subject  of  considerable  study  in  and  of  themselves.  Indeed, 
studies  of  blackbody  radiation  paved  the  way  for  quantum  theory.  Max  Planck  was  forced  to 
propose  that  radiation  existed  in  quantized  form  (Einstein  later  called  them  photons)  in  order  to 
derive  an  expression  for  the  emission  of  a blackbody  that  matched  experimental  measurements. 
That  expression,  described  by  various  people  as  Planck’s  Radiation  Law,  the  Planck  distribution, 
or  the  Planck  function,  takes  slightly  different  forms  depending  on  the  exact  radiometric  quantity 
described.  The  author  has  found  the  expression  for  the  spectral  radiant  exitance  to  be  the  most 
useful.  In  energy-based  form  it  is  given  by  [5] 
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where  has  units  of  W/m^-pm  when  the  temperature  T is  expressed  in  Kelvins  and  the  wave- 
length A,  is  expressed  in  pm.  The  quantum-based  expression  for  the  spectral  radiant  exitance  is 
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where  has  units  of  photons/sec-m^-pm.  The  primary  difference  between  the  two  forms  is  the 
factor  hc/X,  the  photon  energy. 
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The  behavior  of  the  Planck  distribution  is  shown  in  Figure  8-6.  At  any  specific  material 
temperature,  the  radiant  exitance  rises  rapidly  with  increasing  wavelength  to  a peak  and  then  falls 
off  much  more  slowly  as  wavelength  continues  to  increase.  All  points  on  a curve  for  one  value  of 
temperature  always  lie  below  the  corresponding  points  on  any  curve  with  temperature  greater  than 

Tq.  All  points  on  that  first  curve  always  lie  above  the  corresponding  points  on  any  curve  with 
temperature  lower  than  T^.  The  wavelengths  corresponding  to  the  peak  value  of  radiant 

exitance  are  related  to  the  temperature  T of  the  surface  that  produced  them  by  Wien’s  Displacement 
Law 

A = 2^91.156  jum-K  (8.18) 

Thus,  a surface  with  a temperature  of  1000  K has  its  peak  value  of  radiant  exitance  at  a wavelength 
of  2.897756  pm.  A Planck  distribution  with  its  peak  value  at  1 .0  pm  is  produced  by  a temperature 
of  2897.756  K. 


Figure  8-6.  The  Planck  distribution  of  radiant  exitance  for  blackbodies. 
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There  are  times  when  knowledge  of  the  total  radiant  exitance  M is  desired.  This  can  be 
obtained  by  integrating  Eq.  (8.16).  The  result  is  commonly  referred  to  as  the  Stefan-Boltzmann 
Law, 
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where  Mhas  units  of  W/m^,  if  T and  2 are  expressed  as  above.  The  constant  cris  called  the  Stefan- 
Boltzmann  constant.  In  quantum-based  form  the  Stefan-Boltzmann  Law  takes  the  form 
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where  has  units  of  photons/sec-m^. 

In  some  infrared  sensor  analyses  it  is  desirable  to  determine  how  much  the  radiant  exitance 
of  a surface  will  change  if  the  temperature  of  that  surface  is  varied.  This  can  be  determined  by 
taking  the  temperature  derivative  of  Eq.  (8.16).  The  evaluated  form  of  this  derivative  is  given  as 
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where  / dT  has  units  of  W/m^-pm-K.  The  quantum  based  version  takes  the  form 
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where  ! dT  has  units  of  photons/m^-pm-K.  The  product  of  the  temperature  derivative  and 
the  temperature  change  gives  an  estimate  of  the  change  in  spectral  radiant  exitance. 
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Reflectance 


Along  with  atmospheric  attenuation  and  diffraction,  reflection  from  the  target  is  among  the 
critical  factors  that  must  be  considered  in  radiometric  analyses.  As  defined  earlier,  the  reflectance 
yO  of  a material  is  the  ratio  of  the  total  radiant  energy  reflected  by  the  material  to  the  total  radiant 
energy  incident  on  the  material.  This  importance  is  only  partially  due  to  the  potential  for  loss  of 
energy  that  is  posed  by  non-unity  reflectance.  It  is  also  due  to  the  angular  dependence  of  the 
reflected  radiation. 

There  are  two  ideal  reflectors.  The  first  is  the  Lambertian  surface  discussed  in  an  earlier 
section.  The  Lambertian  surface  reflects  energy  with  the  same  angular  dependence  regardless  of  the 
angle  of  incidence  of  the  radiation.  As  shown  in  Figure  8-7,  the  reflected  energy  always  obeys  a 
cosine  of  the  reflection  angle  (relative  to  the  surface  normal).  Lambertian  reflectors  are  often  called 
diffuse  reflecting  surfaces.  The  descriptive  terms  matte  finish  or  flat  (as  in  flat  black)  may  also  be 
used.  When  illuminated  with  coherent  radiation,  Lambertian  surfaces  exhibit  the  phenomenon  of 
laser  speckle  (the  optical  analog  of  Rayleigh  cross  section  fluctuations  in  radar  - both  phenomena 
will  be  discussed  in  later  volumes  of  this  work. 


Figure  8-7.  Directionality  of  reflectance  from  a Lambertian  surface. 
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The  second  ideal  reflector  is  the  perfectly  smooth  surface.  In  the  perfectly  smooth  surface, 
the  reflecting  surface  has  no  scratches,  dents,  bumps,  dust  motes,  chemical  deposits,  composition 
fluctuations,  or  any  other  roughness  on  any  scale  that  approaches  within  several  orders  of  magnitude 
of  the  wavelength  of  electromagnetic  radiation  being  reflected.  In  practical  terms  this  means  that 
surface  height  variations  cannot  exceed  more  than  a few  atomic  diameters  or  noticeable  deviation 
from  ideal  behavior  will  occur.  The  surface  may  be  curved  but  it  cannot  have  the  slightest 
roughness.  The  angular  dependence  of  the  radiation  from  the  perfectly  smooth  surface  is  determined 
solely  by  diffraction.  The  reflecting  surface  will  act  like  an  aperture,  whose  size  and  shape  are  those 
of  the  original  surface  projected  onto  a plane  perpendicular  to  the  initial  propagation  direction.  The 
resulting  diffraction  pattern  will  then  be  directed  into  the  angle  of  the  ordinary  reflection.  If  the 
reflecting  surface  is  curved  the  diffraction  pattern  will  be  further  altered  by  the  geometric  reflection 
characteristics  of  the  surface. 

The  perfectly  smooth  surface  exhibits  mirrorlike  reflections.  Such  surfaces  (and  any 
reflections  from  them)  are  called  specular  surfaces  or  specular  reflections.  When  such  surfaces  are 
oriented  at  just  the  right  angle  with  respect  to  a distant  bright  source,  observers  will  see  a bright 
reflection.  Since  few  situations  have  stationary  reflectors  and  truly  motionless  observer,  the  bright 
reflection  appears  as  a short  flash  of  light  often  called  a glint.  Consequently,  some  people  call  the 
surfaces  glint  reflectors.  Another  descriptive  term  sometimes  used  is  the  term  gloss  finish. 

Few  real  surfaces  have  characteristics  that  match  either  ideal  reflector.  They  usually  possess 
some  mixture  of  both.  The  angular  characteristics  are  broad  and  smooth  without  obvious  diffraction 
characteristics,  but  there  is  a strong  peak  at  normal  incidence.  That  is,  the  surface  exhibits  some 
reflection  at  all  angles,  but  there  is  a noticeable  glint  at  the  mirror  reflection  angle.  Dull  surfaces 
have  a larger  percentage  of  diffuse  (Lambertian)  character;  shiny  surfaces  have  a larger  percentage 
of  specular  (smooth  surface)  character. 


Figure  8-8.  Directionality  of  reflectance  from  a smooth  surface. 
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The  bidirectional  reflectance  distribution  function  (BRDF)  p{6^,  6^,  describes  the  complex 

angular  dependence  of  reflections  from  materials  with  less  than  ideal  behavior.  The  BRDF  is 
defined  as 

(8.23) 

Power  reflected  in  a unit  solid  angle  about  direction 
Power  incident  from  direction 

where  the  subscript  i denotes  incidence  and  the  subscript  r denotes  reflection.  The  BRDF  has  units 
of  “per  steradian”.  As  a result,  BRDF  is  not  limited  to  a maximum  value  of  1 . A specular  reflector 
may  have  a BRDF  in  the  specular  reflection  direction  of  10®  per  steradian  or  higher.  Bidirectional 
reflectance  distribution  functions  are  not  easy  to  measure,  requiring  very  sophisticated  equipment. 
However,  a limited  number  of  measurements  on  important  materials  can  be  found  in  the  literature. 
[6] 


Most  materials  have  no  preferred  directions  (other  than  the  surface  normal).  For  these 
isotropic  materials  there  is  no  specific  dependence  on  There  will  be  a dependence  on  except 
in  the  case  of  the  Lambertian  reflector.  A few  materials  do  have  preferred  directions.  For  example, 
fabrics  have  threads  that  run  parallel  or  perpendicular  to  each  other.  Machined  surfaces  have 
parallel  machining  marks.  Diffraction  gratings  have  parallel  rows  of  microscopic  reflecting 
surfaces.  All  of  these  materials  will  have  explicit  ^ dependences  that  must  be  considered.  This 
possibility  is  usually  only  considered  in  analysis  if  there  is  evidence  that  such  anisotropic  materials 
are  present. 

If  the  radiation  is  reflected  through  180  degrees  directly  back  along  the  direction  of 
incidence,  we  obtain  the  monostatic  reflectance  distribution  function  (MRDF) 

P(^i ^ ^ (f>i ) (8-24) 

This  is  the  reflectance  distribution  that  directly  affects  the  performance  of  laser  sensors.  Figure  8-9 
shows  the  monostatic  reflectance  distribution  function  for  two  real  materials:  flame-sprayed 
aluminum  (made  from  aluminum  powder  which  is  melted  by  a flame  or  plasma  and  blown  onto  an 
aluminum  surface  where  it  solidifies  retaining  much  of  its  powder-like  character  while  still  being 
cemented  onto  the  base  plate)  and  light  brown  gloss  enamel  paint  on  smooth  metal.  These  materials 
are  isotropic  so  we  need  only  plot  the  ^dependence.  Isotropic  materials  obey  the  relation 

pM(0,4.)  = P«(0i)-  (8.25) 

Also  shown  is  the  MRDF  of  a Lambertian  surface.  As  is  evident,  the  flame-sprayed  aluminum  is 
almost  Lambertian,  but  a very  small  amount  of  specular  character  remains.  The  painted  surface 
exhibits  a high  degree  of  specular  character. 
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Figure  8-9.  Monostatic  directional  reflectance  distribution  function. 


The  total  reflectance  p (the  reflectance  associated  with  the  transmittance  and  absorptance) 
can  be  determined  by  integrating  either  the  BRDF  or  MRDF  over  all  possible  reflection  angles,  i.e., 

p-j^  dOsmO  p{0,4>).  (8.26) 

We  will  discuss  only  the  MRDF  from  now  on,  however,  any  of  the  results  can  be  generalized  to  use 
the  BRDF  by  substitution.  If  we  assume  that  the  Lambertian  reflector  has  an  MRDF  of  the  form 

pDiFFusE(d,(f)  = ^^^^  sr-'  (8.27) 

n 

where  is  the  total  reflectance.  Performing  the  integration  over  both  angles  yields 
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(8.28) 


p-\  d<b\  dOsmO  — -n—^- 

Jo  Jo  ^ ^ 

which  verifies  that  Pf^  is  the  total  reflectance. 

If  we  consider  a small  flat  reflecting  surface,  then  it  is  obvious  that  reflection  can  only  occur 
into  a hemisphere  (at  most).  Reflection  cannot  occur  into  the  material.  Because  the  total  reflectance 
contains  only  components  that  have  been  reflected  into  a hemisphere,  the  total  reflectance  of  opaque 
materials  is  sometimes  called  the  hemispherical  reflectance. 

Most  materials  have  MRDF  functions  with  both  diffuse  and  specular  characteristics.  In  an 
approximation  useful  for  some  materials,  the  MRDF  may  be  separated  into  a Lambertian  component 
and  a specular  component 


P SPECULAR^P P DIFFUSE^P • (8.29) 

The  specular  component  could,  in  principle,  be  approximated  by  any  convenient  function.  The  log- 
normal function  provides  a qualitatively  accurate  fit  in  many  instances,  but  does  not  lend  itself  to 
easy  evaluation  of  integrals.  The  author  is  unaware  of  a good  functional  approximation  for  Pspecular- 
It  is  possible  to  state  that  the  product  of  the  peak  value  of  the  MRDF,  /XOjO),  multiplied  by  the 
effective  solid  angle  Qj/2  (nominally  measured  at  the  half-intensity  point)  should  be  less  than  unity, 
i.e.. 


/?(0,0)Qi^2  ~ ~ Ps^^  ’ (8-30) 

where  0y2  is  half-angle  of  the  cone  with  solid  angle  equal  to  the  solid  angle  of  the  reflection.  The 
specular  component  of  the  total  reflectance  is  given  by  p^.  The  more  the  specular  component 
approaches  the  ideal  smooth  surface  result,  the  narrower  the  value  of  Q1/2,  and  the  larger  the 
permissible  value  ofyC>(0,0). 

A cosine-theta  curve  fit  to  the  MRDF  for  large  angles  (well  away  from  the  specular  peak) 
will  yield  the  diffuse  component 


in  /\  Pn  COS ^ 

. (8-31) 
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The  diffuse  component  of  the  total  reflectance  is  given  by  the  coefficient  of  the  curve  fit  p^. 
Obviously,  the  total  reflectance  is  approximately  equal  to  the  sum  of  the  diffuse  and  specular 
components  and  must  be  less  than  unity 

Pd^  Ps~  P^^-  (8-32) 
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All  of  the  preceding  discussions  have  addressed  materials  with  no  transmittance.  Radiation 
is  either  reflected  or  absorbed.  However,  the  same  concepts  can  be  applied  to  transparent  or  semi- 
transparent materials.  The  reflected  component  should  now  be  assumed  to  be  all  radiation  that  is 
neither  absorbed  nor  transmitted  without  directional  change.  Thus,  scattering  is  considered  to  be 
an  element  of  reflection,  as  is  Fresnel  reflection.  However,  in  a transparent  material,  radiation  may 
be  scattered  into  a sphere,  rather  than  only  a hemisphere.  The  MRDF  (and  BRDF)  for  transparent 
materials  is  defined  the  same  as  for  opaque  materials,  but  the  expression  relating  MRDF  to  total 
reflectance  must  involve  the  integral  over  a sphere  (rather  than  a hemisphere) 

p-j^  dPsinP  . (8.33) 

The  total  reflectance  is  obviously  not  related  to  the  hemispherical  reflectance  in  transparent 
materials. 

The  reflectances  described  above  are  functions  of  wavelength.  Two  wavelength  effects 
dominate.  First,  the  total  reflectance  will  vary  with  wavelength  because  the  absorptance  varies  with 
wavelength.  Because  transmittance,  absorptance,  and  reflectance  must  sum  to  unity,  any  change  in 
one  quantity  is  likely  to  change  both  of  the  others.  Second,  even  if  the  total  reflectance  does  not 
change  appreciably  with  wavelength  over  a wide  span  of  wavelengths,  the  MRDF  may  change 
significantly.  Consider  flame-sprayed  aluminum.  This  material  is  a metal  with  a total  reflectance 
near  unity.  Most  metals  reflect  strongly  from  visible  through  radio  wavelengths.  However,  the 
surface  roughness  of  flame-sprayed  aluminum  has  a spatial  scale  of  the  order  of  0. 1 mm.  At  infrared 
wavelengths,  the  surface  is  very  rough  and  has  a diffuse  (almost  Lambertian)  MRDF.  However,  at 
microwave  wavelengths,  the  surface  is  smooth  (compared  to  the  wavelength)  and  the  MRDF  will 
have  a strong  specular  character.  The  total  reflectance  is  near  unity  at  both  extremes,  but  the 
reflection  properties  are  vastly  different.  As  wavelength  is  scanned  from  the  infrared  to  the 
microwave,  the  MRDF  will  transition  from  diffuse  to  specular  character.  At  submillimeter 
wavelengths,  the  MRDF  will  have  elements  of  both  characters. 
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Problems 


8- 1 . A blackbody  has  a temperature  of  3 K.  Plot  the  radiant  exitance  of  this  blackbody  over  the 
wavelength  range  from  100  pm  to  10  cm.  Ten  to  fifteen  points  is  sufficient.  What  is  the 
wavelength  of  peak  emission?  What  is  the  radiant  exitance  at  this  wavelength  (the  peak)? 
Referenced  to  a standard  blackbody  at  300  K,  what  is  the  temperature  increase  in  the 
standard  blackbody  that  provides  an  increase  in  radiant  exitance  of  that  standard  blackbody 
that  is  equal  to  the  total  radiant  exitance  of  the  3 K blackbody?  This  temperature  increase 
is  the  signal-equivalent  temperature  difference. 

8-2.  A material  has  a reflectance  of  90%.  If  that  material  has  a temperature  of  300  K what  is  the 
radiant  exitance  from  the  surface? 

8-3.  A thin  plate  is  perfectly  absorbing  on  one  side  and  perfectly  reflecting  on  the  other.  The 
plate  is  in  a vacuum  and  perfectly  insulated  from  its  environment  except  for  radiation. 
Radiation  from  a 300  K blackbody  is  incident  only  on  the  absorbing  side  of  the  plate.  What 
is  the  equilibrium  temperature  that  the  plate  will  reach?  What  temperature  would  the  plate 
reach  if  both  sides  were  perfectly  absorbing  and  it  is  still  illuminated  from  only  one  side? 

8-4.  A perfectly  collimated  laser  beam  is  incident  on  a perfectly  reflecting  Lambertian  surface 
at  45°  angle  of  incidence.  What  is  the  angular  distribution  of  the  reflected  radiant  intensity? 

8-5.  Two  blackbodies  have  variable  temperatures  that  are  exactly  1 K different.  What  is  the 
temperature  dependence  of  the  difference  between  the  radiant  exitances  of  the  two  plates? 

8-6.  A laser  emits  25  W of  radiation  at  5 14.5  nm.  How  many  lumens  does  this  laser  emit?  The 
laser  radiation  is  shaped  such  that  it  uniformly  illuminates  a diffuse  reflecting  surface  (100 
cm^  area)  without  loss  of  power.  What  is  the  luminous  exitance  (lux)  coming  from  this 
surface? 

8-7.  A dedicated  microwave  communications  link  is  established  between  two  widely  separated 
antennas.  A transmitter  is  attached  to  one  antenna  while  a receiver  is  attached  to  the  other 
antenna.  Using  the  nomenclature  used  in  the  radar  equation  example,  determine  an 
expression  for  the  CNR  of  such  a communications  link  as  a function  of  antenna  separation. 

8-8.  A specular  reflector  has  a total  reflectance  of  1.  If  the  peak  value  of  the  monostatic 
reflectance  is  10®  sr  ',  what  the  half-angle  of  the  cone  into  which  most  of  the  radiation  is 
reflected? 

8-9.  A mirror  has  a diffuse  monostatic  reflectivity  component  PuiffusA^  = 6.366  x 10'^  cos^sr  '. 

What  is  the  monostatic  reflectance  of  the  mirror  if  it  is  tilted  at  60°?  What  is  the  maximum 
possible  value  of  the  specular  component  of  the  total  reflectance? 
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8-10.  A surface  is  covered  with  random  bumps.  The  average  bump  height  is  0.1  pm  and  the 
average  spacing  between  adjacent  bumps  is  0.1  pm.  What  is  the  probable  dominant 
character  of  the  reflectance  at  a wavelength  of  10  pm? 
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CHAPTER  9 


SENSOR  FUNCTIONS 


Uses  of  Sensor  Systems 

There  are  few  operations  that  are  performed  by  modem  eombat  platforms  that  do  not  involve 
sensors  in  one  way  or  another.  Sensors  are  used  to  navigate  from  the  beginning  to  the  end  of  a 
mission.  They  are  used  to  detect,  localize,  track,  and  potentially  identify  possible  targets.  They  are 
used  to  guide  and  control  the  weapons  that  are  used  to  attack  those  targets.  They  are  used  to  detect, 
localize,  track,  and  possibly  identify  potential  threats  to  the  platform  and  to  guide  and  control  the 
weapons  that  are  used  to  defend  against  those  threats.  They  are  used  to  determine  the  status  of 
systems  on  board  the  platform.  They  are  used  to  determine  the  status  of  the  environment  around  the 
platform.  There  are  so  many  functions  that  it  is  not  possible  to  address  them  all  in  any  specific 
fashion. 

Figure  9-1  lists  a number  of  the  functions  that  combat  sensor  systems  are  commonly  called 
upon  to  perform.  Most  of  the  listed  functions  are  self-evident.  However,  a few  of  the  functions  may 
not  be  familiar  to  everyone.  For  example,  pseudo-imaging  is  the  acquisition  of  high-resolution  but 
under-sampled  or  partially  sampled  data  on  a scene  (usually  containing  a target).  For  example,  a 
rosette  scan  will  sample  a scene  only  where  the  scan  pattern  covers  it.  The  sampled  data  could  be 
used  to  generate  an  image  of  sorts,  however,  it  is  seldom  used  to  generate  an  observable  image.  It 
is  more  often  used  to  track  a target  or  possibly  identify  a target.  Range  profiling  is  the  acquisition 
of  high-resolution  range  data  on  a target  without  any  accompanying  angular  resolution  data.  It  has 
limited  use  in  target  identification.  The  meanings  of  terms  like  target  recognition,  target  identifica- 
tion, target  orientation  estimation,  etc.,  are  discussed  in  great  detail  in  Chapter  12.  Situational 
awareness  is  the  determination  of  the  location  and  apparent  motion  of  all  platforms  (whether 
friendly,  hostile,  or  neutral;  and  air,  surface,  or  subsurface)  in  a spherical  volume  out  to  some  range 
surrounding  one’s  own  platform.  It  is  useful  in  permitting  a commander  to  determine  how  a 
complex  situation  is  evolving.  Terrain  following  is  straight  line  flight  along  a vertical  path  that 
attempts  to  maintain  a constant  altitude  above  the  ground.  Terrain  avoidance  is  flight  along  a three 
dimensional  path  that  attempts  to  maintain  a constant  altitude  above  the  ground  but  takes  advantage 
of  the  ability  to  minimize  altitude  variations  (with  respect  to  sea  level)  by  going  around  taller  terrain 
features.  Obstacle  avoidance  is  the  ability  to  detect  vertical  and  horizontal  obstacles  (such  as  radio 
towers  and  or  power  transmission  lines  and  maneuver  over,  under,  or  around  them  while  maintaining 
normal  terrain  following  or  terrain  avoidance  flight.  Bathymetry  is  the  measurement/mapping  of 
ocean  bottom  depth  or  the  depth  of  targets  beneath  the  ocean  surface.  Vibratometry  is  the 
measurement  of  the  vibrations  of  an  object.  It  can  be  useful  in  determining  the  status  of  bearings 
on  rotating  equipment.  It  can  also  be  used  to  identify  tactical  targets  by  their  unique  vibrational 
characteristics. 
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Table  9-1.  Common  sensor  functions. 


SEARCH 

TARGET  DETECTION 

TARGET  LOCATION 

- RANGE 

- AZIMUTH 

- ELEVATION 

VELOCITY/RANGE  RATE  DETERMINATION 

TRACKING 

- ANGLE  ONLY 

- RANGE  & ANGLE 

- RANGE,  ANGLE,  & VELOCITY 

lAAAGING 

PSEUDO-IAAAGING 

RANGE  PROFILING 

OBJECT  RECOGNITION 

OBJECT  IDENTIFICATION 

OBJECT  ORIENTATION  ESTIAAATION 

AAAPPING 

RECONNAISSANCE 
SURVEILLANCE 
SITUATIONAL  AWARENESS 
NAVIGATION 

- TERRAIN  CONTOUR  AAAPPING  (TERCOM) 

- FEATURE-BASED 

- DOPPLER 

- INERTIAL 

- CELESTIAL 


COAAMUNICATIONS  RECEPTION 
AIR  SPEED  MEASUREMENT 
GROUND  SPEED  MEASUREMENT 
ALTIMETRY 

TERRAIN  FOLLOWING 
TERRAIN  AVOIDANCE 
OBSTACLE  AVOIDANCE 

HOMING  GUIDANCE 

- ACTIVE 

- SEMI-ACTIVE 

- PASSIVE 

AIMPOINT  SELECTION 
MODULATION/DEMODULATION 
TARGET  DESIGNATION 

MUNITION  FUZING 

- PROXIMITY 

- IMPACT 

-ALTITUDE  (BAROMETRIC) 
-ALTITUDE  (RADAR) 

DAAAAGE/KILL  ASSESSMENT 

ENVIRONMENTAL  REMOTE  SENSING 

- ANEMOMETRY  (WIND  SPEED) 

- TURBULENCE  DETECTION 

- THERMOMETRY 

- ATMOSPHERIC  CONSTITUTION 

- CONTAMINATION  DETECTION 

- SOIL/ROCK/VEGETATION  TYPE 

BATHYMETRY 


SYSTEM  STATUS  VIBRATOMETRY 

- CONSUMABLES  (FUEL  LEVEL,  OXYGEN,  etc.) 

- PROPULSION  (ENGINE  RPM,  TEMP.,  etc.) 

- ELECTRICAL  (VOLTAGE,  AMPERAGE,  etc.) 

- HYDRAULIC  (PRESSURE) 

- DAAAAGE  CONTROL  (TEMPS,  WATER  LEVELS,  etc.) 
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The  Six  “Elemental”  Sensor  Functions 


The  preceding  section  has  presented  a fairly  comprehensive  catalog  of  functions  that  modem 
combat  systems  are  performed.  As  stated  earlier,  the  list  is  too  long  to  discuss  each  function  in 
detail.  However,  after  performing  a detailed  analysis  of  exactly  what  is  meant  by  most  of  the 
functions  listed  in  Table  9-1  the  author  has  come  to  believe  that  virtually  all  of  them  can  be 
decomposed  into  combinations  of  six  “elemental”  sensor  functions:  search,  detection,  estimation, 
modulation/  demodulation,  image-based  perception,  and  tracking.  That  is,  by  performing  several 
of  the  elemental  functions  in  the  proper  sequence,  with  the  proper  signal  conditioning  between 
functions,  the  most  complex  function  can  be  synthesized. 

Search  is  the  collection  of  activities  involved  in  providing  sensor  coverage  of  areas  of 
interest.  This  function  concerns  itself  with  providing  the  opportunity  for  sensors  to  perform  other 
subsequent  functions  such  as  target  detection  or  tracking.  It  is  a separate  function  from  those 
subsequent  functions.  It  is  possible  to  detect  an  object  without  search  although  this  requires 
consideration  in  the  sensor  design.  It  is  also  possible  to  successfully  and  efficiently  search  an  area 
without  accomplishing  any  detections  (or  indeed  without  any  possibility  of  successful  subsequent 
detection).  The  search  function  involves  coordination  of  sensor  scanning  and  platform  motions  to 
cover  a designated  area  in  minimum  time  and  with  maximum  efficiency  of  coverage  and  sensor 
utilization.  Chapter  1 0 elaborates  on  the  nature  of  the  search  function  and  provides  useful  results 
that  can  impact  sensor  system  design  and  performance. 

Detection  is  the  simple  act  of  determining  whether  an  object  of  potential  interest  is  present 
or  absent  in  the  sensing  volume  of  a system.  It  says  nothing  about  the  object  other  than  it  is  present. 
It  does  not  indicate  whether  it  is  a hostile  target,  a friendly  platform,  or  even  that  it  is  a military 
platform.  Detection  simply  says  whether  or  not  something  other  than  “noise”  is  present  in  the  sensor 
signal.  The  ability  with  which  a sensor  can  detect  a target  will  depend  on  the  carrier-to-noise  ratio 
achieved  in  the  sensor,  the  false  alarm  rate  that  is  acceptable,  and  the  inherent  characteristics  of  the 
targets.  Chapter  1 1 discusses  the  elemental  function  of  detection  in  detail. 

Estimation  is  the  process  of  determining  the  value  of  some  property  of  a “target”.  Range, 
velocity,  altitude,  azimuth,  elevation  angle,  acceleration,  signal  “size”  are  examples  of  properties 
that  are  “estimated”  by  processing  the  signals  from  appropriate  sensors.  The  precision  with  which 
the  value  of  a parameter  may  be  estimated  will  be  shown  to  be  limited  by  the  resolution  of  the 
measuring  sensor  and  the  signal-to-noise  ratio  achieved.  Chapter  12  discusses  the  elemental 
function  of  estimation  in  detail. 

Modulation  and  demodulation  are  seldom  end  functions  of  sensors  but  one  or  the  other  or 
both  are  often  critical  intermediates  in  the  accomplishment  of  desired  end  functions  such  as 
parameter  estimation  or  tracking.  In  some  instances  the  sensor  will  perform  both  modulation  and 
demodulation.  For  example,  a radar  may  transmit  a frequency-modulated  waveform  and  receive  a 
modified  frequency-modulated  waveform  in  the  pursuit  of  estimating  the  range  and  velocity  to  a 
target.  In  others,  one  sensor  will  establish  a modulation  for  another  sensor  to  utilize,  such  as  a coded 
laser  designator  signal  for  a laser  guided  weapon.  In  others  the  interaction  of  a sensor  signal  with 
the  target  may  produce  a modulation  that  is  useful  for  another  function,  as  in  a conical  scan  tracking 
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system.  Many  different  kinds  of  modulation  are  utilized.  These  include  amplitude  modulation, 
frequency  modulation,  phase  modulation,  polarization  modulation,  and  various  pulse  modulations. 
Chapter  13  discusses  modulation  and  demodulation  and  its  relation  to  sensor  systems. 

Imaging,  or  more  properly,  image-based  perception,  is  an  elemental  function  that  is  more 
complex  than  detection  or  estimation.  Imaging  itself  is  merely  a way  in  which  sensor  data  is 
collected  and  organized.  However,  when  an  image  is  presented  to  a human  observer  to  process,  the 
perceptual  processes  that  ensue  are  critical  functions  in  many  weapons  systems.  For  example,  in 
an  electro-optical  targeting  system,  a “television”  sensor  gathers  a near  infrared  image  of  a scene. 
The  image  is  presented  to  the  weapons  operator  who  examines  it  and  may  or  may  not  detect  and/or 
identify  a target,  based  only  on  his  examination  of  the  image.  The  human  perceptual  system  has 
been  characterized  both  empirically  and  semi-empirically  in  terms  of  describable  target  and  image 
characteristics.  Chapter  14  discusses  imaging  and  image-based  perception  in  considerable  detail. 

Tracking  is  the  process  by  which  the  motions  of  detected  objects  are  followed  over  time  and 
predicted  into  the  future.  This  elemental  function  is  also  somewhat  more  complex  than  simple 
detection  or  estimation.  Tracking  can  be  achieved  by  dedicated  sensor  and  processor  hardware.  It 
can  also  performed  entirely  as  a processing  function  if  an  adequate  detection  sensor  is  present.  The 
former  dedicated  systems  are  basically  servo  systems.  Target  motion  produces  an  error  signal  which 
the  servo  acts  upon  to  reduce.  As  the  error  signal  is  kept  within  bounds,  the  position  of  the  target 
is  kept  within  known  limits.  Tracking  systems  which  do  nothing  but  process  data  from  detection 
systems  rely  on  tracking  filters.  This  filters  average  measurements  and  predict  motions.  Subsequent 
measurements  are  compared  against  predictions  and  the  prediction  is  continuously  corrected. 
Chapter  15  describes  both  types  of  tracking.  Principles  and  implementation  of  dedicated  tracking 
systems  are  described.  The  theory  of  tracking  filters  is  presented  and  the  performance  of  different 
filters  is  compared. 

The  author  is  not  completely  convinced  that  this  list  of  elemental  sensor  functions  is 
complete,  yet  so  far  has  not  identified  others  that  seemingly  fulfill  all  of  his  criteria.  He  may  change 
his  mind  in  the  future.  In  establishing  this  list,  he  restricted  it  to  functions  that  have  direct  utility, 
that  is,  the  output  of  the  elemental  function  is  of  immediate  interest  to  the  user  or  the  function  itself 
is  mission  critical.  He  also  restricted  the  list  to  those  functions  for  which  considerable  theoretical 
analysis  is  required  to  determine  their  impact  on  sensor  performance.  He  tried  to  eliminate  functions 
that  are  implemented  in  single  components  and  are  simply  considered  tools  of  the  sensor  designer. 
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Incidental  Sensor  Functions 


In  light  of  the  preceding  section  where  six  elemental  sensor  functions  were  identified,  a 
number  of  incidental  functions  were  also  identified.  These  are  functions  that  sensor  designers 
incorporate  to  improve  performance,  permit  use  of  specific  technologies,  or  perform  necessary  signal 
processing  functions.  They  are  not  elemental  in  that  there  are  usually  alternative  sensor  implementa- 
tions that  can  avoid  their  incorporation,  yet  can  perform  the  same  top  level  functions  for  the  user. 
For  example,  analog-to-digital  conversion  may  simplify  the  processing  of  sensor  signals,  yet,  analog 
processing  can  almost  always  yield  comparable  results  (most  sensor  technologies  predate  the 
invention  of  the  digital  computer).  In  Table  9-2  we  list  some  of  the  more  significant  incidental 
sensor  functions. 


Figure  9-2.  A Partial  List  of  Incidental  Sensor  Functions. 


TARGET  ILLUMINATION 

DISPLAY 

TIMEKEEPING 

AMPLIFICATION 

FILTERING 

HETERODYNE  MIXING 
ANALOG-TO-DIGITAL  CONVERSION 
DIGITAL-TO-ANALOG  CONVERSION 
SIGNAL  MULTIPLEXING/DEMULTIPLEXING 
FAST  FOURIER  TRANSFORM  (FFT) 

SIGNAL  INTEGRATION 
SENSOR  POINTING 
DETECTOR  COOLING 
INERTIAL  LINE-OF-SIGHT  STABILIZATION 
ELECTRICAL  POWER  CONDITIONING 
ENVIRONMENTAL  CONDITIONING 
DATABASE  MANAGEMENT 
DIGITAL  DATA  PROCESSING 
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Problems 


9- 1 . Identify  a sensor  function  that  is  not  included  in  Table  9- 1 . 

9-2.  Pick  a complex  sensor  function  from  Table  9-1  and  identify  how  the  elemental  functions 
combine  and  interact  to  create  that  complex  function.  Identify  any  critical  subfunctions  that 
are  not  covered  by  the  six  elemental  functions. 

9-3.  Nominate  a candidate  for  a seventh  elemental  sensor  function.  Justify  your  nomination. 
This  function  need  not  be  listed  in  Table  9- 1 . Does  your  nomination  suggest  a need  for  more 
than  seven  elemental  functions?  That  is,  is  there  some  related  function  identified  that  is 
neither  part  of  the  six  original  functions  nor  your  nominated  seventh  function? 
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CHAPTER  10. 


SEARCH 


Introduction  to  Search 

Search  is  the  collection  of  activities  involved  in  providing  sensor  coverage  of  areas  of 
interest.  This  function  concerns  itself  with  providing  the  opportunity  for  sensors  to  perform  other 
subsequent  functions  such  as  target  detection  or  tracking.  Search  involves  the  coordination  of  sensor 
scanning  and  platform  motions  to  cover  a designated  area  in  minimum  time  and  with  maximum 
efficiency  of  coverage  and  sensor  utilization.  Search  and  detection  are  separate  functions  - it  is 
possible  to  search  with  sensors  that  are  malfunctioning  or  are  otherwise  incapable  of  adequate 
detection  performance.  Nevertheless,  search  and  detection  are  interdependent.  Without  effective 
search,  the  detection  abilities  of  even  the  best  sensors  are  underutilized.  Conversely,  without 
adequate  detection  ability,  even  an  optimum  search  strategy  may  produce  unsatisfactory  results. 

Depending  on  the  mission  being  performed  and  the  conditions  under  which  it  is  performed, 
search  may  be  carefully  planned  and  executed,  it  may  be  simple,  regular,  and  repetitive,  or  it  may 
be  random.  It  may  involve  continuous  observation  during  a continuous  sweep  or  it  may  involve 
discrete  looks  at  discrete  positions.  It  may  be  carried  out  by  stationary  platforms  or  it  may  require 
platform  motion  to  accomplish.  The  detection  range  and  coverage  of  the  sensors  utilized  will  affect 
the  choice  of  search  type  and  the  speed  at  which  it  can  be  accomplished.  The  interrelation  of  sensor 
and  search  type  are  the  subject  of  this  Chapter. 

The  search  problems  commonly  encountered  in  combat  systems  applications  can  usually  be 
sorted  into  one  of  two  general  types.  One  search  type  involves  the  one-time  search  of  a large  area 
using  a sensor  of  limited  range  operated  from  a mobile  platform.  We  will  refer  to  problems  of  this 
type  as  area  search  problems.  Typical  area  search  applications  include  anti-submarine  patrol  by 
long-range  aircraft,  target  acquisition  by  antiship  missiles,  mine-hunting,  and  “SCUD  hunting”  by 
attack  aircraft.  The  shape  of  the  search  area,  the  range  of  the  detection  sensor,  the  mobility  and 
degree  of  reaction  expected  from  the  target  being  searched  for,  and  the  time  available  for  the  search 
will  all  affect  the  nature  and  potential  for  success  of  area  searches.  A second  type  of  search  involves 
repeated  search  of  a designated  solid  angle  out  to  some  required  range  by  a scanning  sensor  operated 
from  a platform  that  may  be  assumed  to  be  stationary.  This  type  of  problem  will  be  called  volume 
search.  Air  defense  search,  ballistic  missile  early  warning,  and  target  acquisition  by  anti-aircraft 
missile  seekers  are  typical  examples  of  volume  search.  The  solid  angle  to  be  searched,  the  required 
detection  range,  the  revisit  rate  required,  and  the  angular  resolution  of  the  search  sensor,  among 
other  things,  will  all  impact  volume  search  performance.  In  subsequent  sections  we  will  address  the 
peculiarities  of  area  search  and  volume  search  in  more  detail. 
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Regardless  of  search  type,  some  search  systems  will  utilize  a continuous  scan  of  a suitable 
sensor  system  while  others  will  use  measurements  obtained  at  discrete  positions,  the  discrete 
positions  being  scanned  over  the  search  area.  Discrete  measurements  complicate  the  search  process 
in  ways  that  are  described  in  the  following  section. 

Finally,  the  ultimate  goal  of  search  is  target  detection.  Detection  is  addressed  in  Chapter  1 1 . 
The  effects  of  non-unity  detection  probability  on  the  choice  of  search  patterns  and  of  search 
parameters  are  discussed  in  the  last  section  of  this  chapter. 
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Circle  Packing 

Discrete  measurement  search  involves  the  one-at-a-time  observation  of  limited  portions  of 
the  search  region.  It  is  very  common  that  each  observation  covers  a region  that  is  roughly  circular. 
For  example,  if  the  measurement  is  made  by  a single  pulse  from  a radar,  the  beam  of  the  radar  is 
likely  to  be  circular.  If  the  “measuremenf  ’ is  an  azimuthal  scan  by  a radar  or  a conical  scan  by  an 
infrared  sensor,  the  area  covered  by  the  scan  will  be  circular.  Using  this  circular  assumption 
(extension  of  any  circular  region  result  to  an  elliptical  region  result  is  trivial)  we  observe  that  it  is 
impossible  to  “tile”  any  search  region  with  circular  tiles  without  either  leaving  gaps  (reduced  degree 
of  coverage)  or  having  significant  overlaps  (reduced  efficiency  of  coverage).  Figure  10-1  shows  two 
tiling  patterns  (hexagonal  and  square)  drawn  with  no  overlap  between  the  circular  tiles.  Both 
patterns  show  that  there  is  considerable  uncovered  area.  To  reduce  the  uncovered  area  to  zero  will 
require  overlap  between  tiles  and  a reduction  in  the  efficiency  of  coverage.  The  relationship 
between  completeness  of  coverage  and  the  efficiency  of  coverage  is  analyzed  in  the  circle  packing 
problem.  It  should  be  noted  that  the  two  tiling  patterns  studied  here  (hexagonal  packing  and  square 
packing)  are  the  only  two  patterns  of  packing  equally  sized  circles  observed  in  nature.  Hexagons, 
squares,  and  triangles  are  the  only  regular  polygons  that  can  be  tiled  without  overlap  and  without 
gaps  (and  triangles  can  only  be  tiled  if  the  orientation  of  the  triangles  is  alternated. 


Figure  10-1.  Two  regular  tiling  patterns  using  circular  “tiles”. 
The  patterns  as  drawn  have  zero  tile  overlap. 


HEXAGONAL  TILING 


SQUARE  TILING 
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Consider  first  the  case  where  the  circular  coverages  are  sparsely  (yet  still  regularly)  tiled. 
That  is,  the  separation  between  the  centers  exceed  the  radii  of  the  circles.  Let  a denote  the  radius  of 
a circular  coverage  area  and  let  2d  denote  the  separation  between  nearest  neighbor  circle  centers. 
In  this  case  it  is  relatively  easy  to  calculate  that  the  fraction  of  the  total  area  covered  by  circles  is 
given  by 


^HEX  ~ ^CIRCLE  ^ ^HEXAGON  ~ I 2-\fi  d — ^7T  / 2^/3^ (10-1) 
for  hexagonal  packing  and  by 

dsQ  ~ ^CIRCLE  ^ ^SQUARE  ~ / 4)^  (10.2) 

for  square  packing.  We  have  defined  a dimensionless  separation  parameter  d/a.  The  coverage 
is  relatively  poor  (roughly  90%  for  hexagonal  packing  and  79%  for  square  packing)  even  if  ^ = i. 

Improved  coverage  requires  overlap.  Figure  10-2  shows  the  geometry  of  coverage  and 
overlap  in  the  two  forms  of  packing.  Figure  10-3  shows  the  geometry  of  hexagonal  packing  in  a 
higher  level  of  detail.  Determination  of  the  area  that  is  not  covered  (the  small  triangular  shaped 
region  at  the  center  of  the  figure)  is  not  amenable  to  a simple  mathematical  expression.  It  requires 
calculation  of  the  area  of  more  regular  geometric  objects  and  adding  and  subtracting  the  areas  of 
those  objects  until  the  residual  corresponds  to  the  area  of  interest. 


Figure  10-2.  Geometry  used  in  calculating  overlap  and  coverage  gaps  in  a)  hexagonal  packing 
(or  tiling)  and  b)  square  packing.  Overlap  regions  are  shown  by  +45°  diagonal  shading. 
Uncovered  regions  are  shown  by  -45°  diagonal  shading. 

a)  b) 
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Figure  10-3.  Geometry  of  overlap  and  non-coverage  calculations  in  hexagonal  packing 
showing  the  different  geometric  regions  used  in  the  calculation. 


The  area  of  the  small  triangular  region  (-45°  striped)  that  is  not  covered  by  a circle  can  be 
calculated  using  the  following  general  expression 

Area  Not  Covered  = Area  of  Large  Triangular  Region 

- 3 X Area  of  a 60°  Sector 

+ 3/2  X Area  of  The  Lens-Shaped  Overlap  Region 
~ ^LTR  ~^^60S  ^-^^LSOR  (10.3) 

which  can  be  further  decomposed  into 

Area  Not  Covered  = Area  of  Large  Triangular  Region 

- 3 X Area  of  a 60°  Sector 

+ 3/2  x2  X (Area  of  Cross-Hatched  Sector 
- Area  of  Checked  Triangle) 

~ ^LTR  ■ ^^60S  "^cr)  • (10.4) 

The  area  of  the  large  equilateral  triangle  is  easily  seen  to  be 

Altr  = (10.5) 
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as  is  the  area  of  the  three  60°  sectors 


3 - 0.5:/ra^ . (10.6) 

The  area  of  the  cross-hatched  sector  is  given  by 

Af^fjg  =Q56a^  = cos~^{d ! a).  (10.7) 

The  area  of  the  checked  triangle  is  given  by 

A(.j.  = 0.5cd  = d-^a^  -d^  . (10.8) 

The  area  not  covered  by  the  circles  can  now  be  calculated  by 

Aj^c  - +3{a^  cos~^[d  / a)-d-\la^  - j . (10.9) 


For  each  circular  spot  added  to  the  hexagonal  pattern,  two  small  triangular  areas  with  no  overlap  are 
produced.  Each  added  circular  spot  covers  an  effective  hexagonal  area 

= 2.59808jc^  = 2-j3d^  (10.10) 

where  x is  the  side  of  the  hexagon.  Thus  the  effective  fractional  coverage  in  hexagonal  packing  is 

I - 0.57Ta^  + 3{a^  cos”' {d ! a)- d sla^  -d^  j j 

’1hEX=1  ^2 

or 

0.57Ta^  cos”'(t/ / a)-d^a^  -d^\ 

n„Ex=  -^2  ■ 

If  we  define  a dimensionless  ratio  ^ = d/a,  then  the  fractional  coverage  becomes 

Vhex  = (l  / 2 V3);r^”"  - cos”'  (^)  - ^1  - ) (10.13) 

Eq.  (10.13)  is  essentially  the  same  as  Eq.  (10.1)  with  a correction  accounting  for  overlap. 

The  efficiency  in  covering  the  area  is  given  by 
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^ HEX  ~ 1 


cos  ^{d  / a)-d^a^  j 


(10.14) 


£„„=l-(3/;r)(cos-'|-|Vl^) 


(10.15) 


Following  the  general  approach  taken  above,  but  without  showing  the  details,  the  square 
packing  problem  yields  the  following  results.  In  the  square  coverage  pattern,  the  diamond  shaped 
area  not  covered  has  area 


— 4t/  7EQ.  +4 


a 


cos  *(<i / a)-d-\la^  -d‘ 


The  effective  fractional  coverage  is  given  by 


4t/  — 7VQ  + 4 


^SQ 


= 1 


fl^cos  ^{d  / a)-d-\Ja^  -d" 


or 


Tta^  -4 


^sQ  - 


a 


4d^ 

cos“' [d  / a)- d -d" 


4d^ 


or 


dsQ 


= (n: / 4)r^  - fr^  cos-‘ (|) - T' 


The  efficiency  in  covering  the  area  is  given  by 


^SQ  - 1 ' 


fl^cos  \d  / a)-d-\Ja^  -d" 


Tta 


or 


(10.16) 


(10.17) 


(10.18) 

(10.19) 


(10.20) 


(10.21) 


The  results  for  the  fractional  coverage  and  the  coverage  efficiency  are  summarized  in 
graphical  form  in  Figure  10-4.  Only  the  critical  region  between  zero  overlap  (<^  =1)  to  complete 
overlap  are  plotted.  It  is  obvious  that  hexagonal  packing  is  superior  to  square  packing.  Consider- 
ably less  overlap  is  required  to  achieve  complete  coverage.  The  worst  case  efficiency  is  only  about 
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92%.  Square  packing  requires  considerably  more  overlap  to  achieve  complete  coverage.  In  square 
packing,  the  worst  case  efficiency  can  be  as  low  as  64%.  However,  hexagonal  packing  is 
considerably  harder  to  accomplish  in  practice  than  is  square  packing.  That  is  why  both  types  of 
packing  have  analyzed  here. 


Figure  10-4.  Fractional  coverage  {rf)  and  coverage  efficiency  (s)  for 
hexagonal  and  square  circle  packing  as  function  of  the  overlap  parameter 
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Area  Search 


Let  us  consider  searching  an  irregular  area  with  a search  sensor  with  finite  area  coverage. 
The  configuration  envisioned  is  similar  to  that  shown  in  Figure  10-5.  The  search  process  is  desired 
to  regular,  that  is,  it  can  be  accomplished  without  overly  complex  and  frequently  changing  motions. 
Such  a search  will  exhibit  three  kinds  of  inefficiencies.  The  regular  motion  will  cause  some  small 
areas  to  be  missed,  causing  gaps  in  the  search  coverage.  It  will  also  cause  some  search  effort  to  be 
expended  outside  the  search  area,  an  inefficiency  this  author  calls  oversearch.  The  regular  motion 
may  also  cause  the  finite-sized  instantaneous  search  area  to  turn  back  upon  itself  This  invariably 
happens  in  back  and  forth  search  motions.  The  small  area  which  is  scanned  more  slowly  or  scanned 
more  than  once  can  be  called  re-search. 

All  three  processes  are  undesirable.  Gaps  in  coverage  can  lead  to  reduced  detection 
probability.  Oversearch  and  re-search  do  not  contribute  to  detection  (ostensibly  any  targets  are 
contained  within  the  designated  search  area)  but  cause  extra  time  to  be  consumed.  This  delays 
detection  which  gives  the  target  more  time  to  react  by  attempting  to  escape,  by  attacking  the 
searcher,  or  by  accomplishing  more  of  its  original  mission. 


Figure  10-5.  Inefficiencies  in  area  search. 
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In  ideal  exhaustive  search  it  is  assumed  that  the  area  to  be  searched  is  completely  covered 
once  (with  no  gaps  in  coverage),  without  wasted  effort  searching  outside  the  designated  area  (over- 
search) and  without  searching  any  area  twice  (re-search).  Let  ^j.  be  the  actual  area  of  the  region  to 
be  searched.  Let  the  search  be  accomplished  by  a sensor  system  capable  of  search  a swath  of  width 
W carried  by  a platform  moving  at  velocity  V.  An  ideal  exhaustive  search  can  be  completed  in  a 
time 


T^Aj./WV.  (10.22) 

The  fraction  of  the  total  area  that  can  be  searched  in  time  t is  given  by 

A(t)  = WVt  (10.23) 

or 

A{t)  I A^  = WVt  I A^  = t IT . (10.24) 

In  a time  T the  entire  area  can  be  searched.  We  will  assume  a “cookie  cutter”  detection  process. 
That  is  if  the  target  is  within  an  area  that  has  been  exhaustively  searched,  it  will  have  been  detected 
with  unit  probability.  If  it  is  outside  the  area  that  has  been  searched,  it  will  not  be  detected.  That 
is,  detection  occurs  when  the  swath  of  the  search  sensor  passes  over  the  location  of  the  target.  As 
a function  of  elapsed  time  we  have 

P(t)^WVt ! Aj^yt  (10.25) 

where  P{i)  is  the  cumulative  detection  probability.  If  a target  is  located  at  a random  position  within 
the  search  area,  then  the  probability  density  of  detecting  the  target  is  a uniform  function  of  time  with 
a range  from  0 to  T.  The  expected  value  of  the  time  required  for  target  detection  is  T/2. 

Let  us  consider  a random  search  of  the  area  rather  than  an  exhaustive  search.  A random 
search  covers  the  search  area  at  a fixed  rate  of  area  coverage  per  unit  time,  but  rather  than  visiting 
each  “segment”  of  area  one  at  a time  in  order  until  the  area  is  covered,  the  “segments”  are  selected 
and  searched  at  random.  Each  segment  has  the  same  probability  of  being  selected  next  as  any  other 
segment,  including  those  already  searched.  Thus,  there  is  a small  but  finite  probability  that  some 
segment  will  never  be  searched. 

Assume  the  target  has  not  yet  been  detected.  Because  the  target  is  as  likely  to  be  in  the  next 
small  volume  to  be  searched  as  it  is  in  any  other  small  volume,  the  probability  density  that  the  target 
will  be  detected  in  a unit  interval  of  time  t t+At  is  given  by 

p{t)^t  = y^t . (10.26) 

Let  the  quantity  q{i)  represent  the  probability  that  the  target  has  not  been  detected  by  time  t.  Then 
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q(t)=\-P(t). 


(10.27) 


The  probability  that  the  target  is  still  not  detected  by  the  time  t+At  later  is  given  by  the  probability 
that  it  has  not  been  detected  at  time  t multiplied  by  the  probability  that  the  target  is  not  detected  in 
the  interval  t t+At.  That  is, 


q{t  + A?)  = q{t)\\  - A?] 


Rearranging  terms  we  have 


q{t  + A?)-^(?) 
At 


-rq{t) 


(10.28) 


(10.29) 


Since  At  is  arbitrary,  we  may  take  the  limit  as  it  approaches  zero.  In  this  case  Eq.(  10.29)  reduces 
to  a differential  equation 


^=-M,)  or  !^  = dlnq{,)  = -rdt. 


(10.30) 


The  latter  form  of  the  equation  is  easily  recognized  as  the  logarithmic  derivative  with  solution 

q[t)-e  (10.31) 

The  cumulative  probability  of  detection  as  a function  of  time  then  becomes 

P{t)  = \-e~^^  (10.32) 

The  cumulative  probability  for  random  search  (Eq.(  10.32))  is  compared  with  that  of  exhaustive 
search  (Eq.(10.25))  in  Figure  10-6.  Clearly  exhaustive  search  is  superior.  Note:  We  have  assumed 
that  y is  constant  in  the  derivation  above.  If  for  some  reason  y is  a function  of  time,  i.e.,  y=  y{t), 
then  the  exponent  {-n{f)  = -yt)  in  Eq.(10.32)  can  be  replaced  by  the  integral  of  y{t)  over  time 

- n{t)  = -yt  dt  y{t)  (10.33) 

with  no  loss  in  validity. 

The  question  arises  as  to  why  bother  to  derive  the  random  search  expression?  Before 
answering  this,  let  us  ask  another  question.  What  if  the  target  (implicitly  assumed  to  be  stationary 
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Figure  10-6.  Comparison  of  detection  probability  versus  time  for  exhaustive  and  random 

search. 


0.0  0.5  1.0  1.5  2.0  2.5  3.0 

COVERAGE  FACTOR 


in  the  analysis  above)  is  free  to  move  about  within  the  search  area  and  attempt  to  evade  the  searcher? 
Empirically,  it  has  been  found  that  evading  targets  will  turn  an  exhaustive  search  problem  into  a 
random  search  problem.  This  does  not  mean  that  the  searcher  should  use  a random  search  pattern. 
The  searcher  should  almost  certainly  pursue  one  of  the  exhaustive  search  patterns,  but  that  searcher 
will  need  to  search  the  area  repeatedly.  The  probability  of  detection  as  a function  of  time  will  more 
likely  follow  Eq.(  10.32)  rather  than  Eq.(  10.25). 

A special  case  of  area  search  results  when  a target  is  initially  known  to  be  at  a precise 
location,  but  the  “searching”  platform  is  located  a considerable  distance  away.  At  the  time  of  initial 
detection,  the  target  leaves  that  position  at  a substantial  speed  in  a random  direction.  The  search 
sensor  must  now  try  to  find  the  target  in  an  area  that  is  constantly  expanding  in  time.  A prime 
example  of  this  can  be  found  in  undersea  warfare.  When  a submarine  attacks  a ship,  the  radio  signal 
(SOS)  from  the  ship  results  in  the  submarine’s  position  being  very  precisely  known  at  the  time  of 
attack.  Anti-submarine  warfare  (ASW)  patrol  aircraft  may  be  hundreds  of  km  distant.  They  reach 
the  position  of  the  attack  many  minutes  if  not  hours  after  the  submarine  has  left  the  area.  The 
aircraft  does  not  know  which  direction  the  submarine  has  taken.  It  must  then  attempt  an  exhaustive 
search  of  an  ever-expanding  area.  Let  the  target  escape  velocity  be  U and  the  time  delay  before 
beginning  the  search  be  r.  At  time  t the  area  containing  all  possible  target  positions  is  given  by 
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A^(t)^;rU^t  + Tf  . 


(10.34) 


If  the  swath  width  and  speed  of  the  search  platform  are  the  same  as  defined  earlier,  then  the 
time  dependent  probability  of  detection 


p{t)  = y{t)  = VW / = 


VW 

7lU\t  + Tf  ' 


(10.35) 


If  we  evaluate  the  probability  exponent 


n{t)  = ]ldtY{t)  = \'dt 


VW 


7rU\t  + T) 


VW  I 1 

< 

ttU^  \ t t + T 


(10.36) 


we  then  have 


p(0  = i 


VW  \ \ 1 

f+r 


(10.37) 


If  we  let  t — > 00,  then  we  find 


VW 


P(oo)  = 1 


e <1. 


(10.38) 


That  is,  the  detection  probability  does  not  go  to  unity  no  matter  how  long  the  platform  searches. 
This  is  reasonable,  because  although  the  area  searched  grows  linearly  with  time,  the  area  that  needs 
to  be  searched  grows  as  the  square  of  the  elapsed  time.  The  rate  of  probability  growth  is  highest 
immediately  after  time  r and  falls  rapidly  as  t becomes  large  compared  to  t.  If  the  target  is  not 
detected  quickly,  it  will  probably  never  be  detected. 

However,  there  must  be  a limit  to  the  validity  of  Eq.(10.37).  If  the  distance  the  target  can 
move  Ut  is  smaller  than  one-half  the  swath  width,  i.e.,  W/2,  then  it  is  clear  that  if  the  search  swath 
passes  directly  (centered)  over  the  original  target  location,  then  it  must  also  pass  over  the  current 
location  of  the  target.  It  must  be  possible  for  the  detection  probability  to  be  unity  for  some  choice 
of  variables.  We  have  just  shown  that  P(co)  = 1 , if  26/ t/W  < 1 . It  is  probable  that  somewhat  larger 
values  of  2Ut/W  may  also  permit  unity  detection  probability,  but  an  exact  result  is  not  available. 

What  search  pattern  should  be  used  in  exhaustive  search?  If  nothing  is  known  about  the 
target  beforehand,  except  that  the  target  is  not  likely  to  attempt  evasion  (e.g.,  because  it  is  a fixed 
site),  then  a simple  back-and-forth  scan,  as  shown  in  Figure  10-5,  may  be  considered  optimum. 
It  provides  relatively  complete  coverage  with  minimal  oversearch  and  minimal  re-search.  If  on  the 
other  hand,  it  is  known  that  a stationary  target  has  a higher  probability  of  being  near  the  center  of 
the  search  area  than  near  an  edge,  then  a “spiral-out”  search  pattern  (as  shown  in  Figure  10-7)  is 
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Figure  10-7.  Spiral  scan  patterns.  The  spiral-out  pattern  yields  minimum  search  times 
for  targets  that  are  more  likely  near  the  center  of  a search  area.  The  spiral-in  pattern 
yields  the  maximum  detection  probability  for  maneuvering  targets. 


SPIRAL-OUT  SCAN 


SPIRAL-IN  SCAN 


preferred.  The  search  is  started  at  the  most  probable  location  and  the  area  is  covered  by  an  ever- 
growing spiral.  The  spiral  out  pattern  has  a search  rate  that  is  comparable  to  the  back-and-forth 
pattern  but  on  the  average  will  provide  earlier  detection  (the  high  probability  area  is  searched  first 
followed  by  the  lower  probability  areas).  There  are  many  instances  where  it  is  known  that  the 
probability  is  higher  that  the  target  is  near  the  center  of  the  search  pattern.  For  example,  if  a 
submarine  is  initially  detected  by  a surveillance  system  to  be  located  at  a given  location  and 
traveling  at  known  speed  in  a fixed  direction,  then  the  most  probable  location  is  the  forward 
projection  of  the  established  track.  This  will  be  the  true  position  if  the  submarine  does  not  change 
speed  or  maneuver.  The  longer  the  submarine  waits  to  maneuver,  the  closer  it  will  be  to  the 
predicted  location.  Since  a maneuver  may  be  assumed  to  be  equally  likely  in  any  interval  of  time, 
only  the  least  likely  maneuvers  (those  made  immediately  after  the  track  was  established)  can 
produce  targets  at  the  edge  of  the  uncertainty  volume  (which  also  comprises  the  largest  volume 
element).  The  submarine  is  more  likely  to  be  within  the  inner  25%  of  the  search  area  than  it  is  in 
the  outer  75%. 

If  it  is  suspected  that  the  target  will  attempt  to  evade  detection,  but  the  search  swath  and 
velocity  are  large  and  the  target  velocity  is  small,  then  a “spiral-in”  or  trapping  spiral  (also  shown 
in  Figure  10-7)  may  prove  advantageous.  The  search  pattern  circles  the  likely  uncertainty  area  and 
moves  in.  If  is  the  average  distance  from  the  center  of  the  spiral  of  one  loop  around  the  spiral, 
W is  the  swath  width,  V is  the  search  velocity,  and  U is  the  target  velocity,  the  time  required  to 
search  that  loop  of  the  spiral  is  given  by 
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(10.39) 


Ts^ItvRsIV  . 

The  maximum  radial  distance  that  the  target  can  move  in  this  time  is 

= UTs  = ItiRsU  / V . (10.40) 

If  is  smaller  than  W then  the  target  cannot  escape  from  the  ever-narrowing  spiral.  Each 
succeeding  loop  around  the  spiral  takes  less  time  than  the  preceding  loop.  Thus,  the  target  can  only 
move  a shorter  and  shorter  distance  before  the  next  loop  begins.  If  i?;;  > W then  the  target  has  a 
significant  but  not  unity  probability  of  escaping  detection.  The  spiral-in  search  pattern  will 
maximize  the  probability  of  detecting  an  evading  target.  On  the  other  hand,  if  the  target  does  not 
attempt  to  evade,  then  detection  will  not  occur  until  all  of  the  area  has  been  searched.  A maximum 
search  time  will  be  consumed.  Each  possible  search  pattern  has  advantages  and  disadvantages 
depending  on  the  behavior  of  the  target. 

The  physical  process  of  searching  an  area  is  usually  accomplished  by  combining  two  kinds 
of  motion.  The  swath  width  is  usually  generated  by  scanning  a sensor  with  a relatively  narrow  field 
of  view  in  some  back  and  forth  fashion.  This  is  then  coupled  with  platform  motion  in  some  desired 
pattern  (back-and- forth,  spiral-in,  spiral-out,  etc.)  to  accomplish  the  total  area  search.  The  scanning 
to  generate  the  swath  width  can  be  accomplished  in  several  ways.  A helical  scan  (a  simple  conical 
scan  coupled  with  forward  platform  motion)  will  produce  two  roughly  parallel  bar  scans  (one 
leading  the  motion  and  one  trailing  the  motion).  This  is  shown  in  Figure  10-7.  The  portions  of  the 
scan  parallel  to  the  platform  motion  are  often  discarded  as  there  is  considerable  re-search  which 
merely  adds  to  the  processing  burden.  This  scan  is  easily  implemented  even  relatively  large 
apertures  can  be  scanned  at  reasonable  high  rates,  but  is  very  inefficient  as  no  more  than  50%  (and 
possible  much  less)  of  the  scan  provides  useful  area  coverage.  The  rate  of  the  scan  must  be  high 
enough  that  the  ground-projected  distance  covered  by  the  field  of  view  of  the  sensor  is  greater  than 
the  distance  traveled  by  the  platform  between  scan  bars. 

Back  and  forth  line  scans  are  a more  efficient  way  to  generate  the  swath.  The  back  and  forth 
motion  usually  takes  a simple  form.  The  transverse  motion  of  the  sensor  field  of  view  may  be 
sinusoidal  in  time  or  it  may  be  linear.  Simple  linear  motion  results  in  a zigzag  or  sawtooth  scan 
(Fig.  10-7).  The  sawtooth  scan  suffers  from  the  fact  that  if  the  area  is  covered  completely  (no  gaps 
between  zigs  and  zags)  then  each  element  of  area  is  viewed  twice  (the  re-search  factor  - the  average 
number  of  times  each  element  of  search  area  is  searched  - is  roughly  2).  If  the  re-search  factor  is 
reduced,  then  the  coverage  must  be  reduced  less  than  100%.  This  can  be  ameliorated  by  using  a 
rectangular  scan  (or  bowtie  scan).  In  the  bowtie  scan  a longitudinal  motion  is  coupled  to  the 
transverse  motion.  Ignoring  platform  motion,  the  angular  motion  of  the  field  of  view  will  scan  a 
closed  pattern  that  has  a bowtie  shape  (also  shown  in  Figure  1 0-7).  When  platform  motion  is  added, 
the  search  pattern  is  transformed  into  rectangular  motion  on  the  ground  (thus  the  two  different  ways 
of  referring  to  the  same  process).  This  scan  is  more  difficult  to  implement  but  has  minimum  re- 
search and  very  little  coverage  loss. 
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Sinusoidal  scan  is  relatively  easy  to  produce  and  can  have  relative  low  re-search  and  relative 
low  coverage  loss.  The  major  drawback  to  sinusoidal  scan  is  a variable  dwell  time.  Dwell  time  is 
the  effective  time  that  the  sensor  actually  views  each  element  of  the  search  area.  The  signal-to-noise 
ratio  of  many  sensors  depends  strongly  on  the  dwell  time.  The  dwell  time  is  highest  at  the  edges  of 
the  swath  and  shortest  at  the  center  of  the  swath.  The  variation  in  sensor  performance  as  a function 
of  scan  position  is  undesirable  and  may  not  be  acceptable  from  a detection  perspective  (detection 
in  this  instance  is  not  cookie  cutter  and  requires  the  more  detailed  analysis  of  the  form  described  in 
Chapter  11). 


Figure  10-7.  Sensor  scan  patterns  commonly  employed  in  area  search. 


a)  HELICAL 


b)  SINUSOIDAL  c)  ZIG-ZAG 


d)  BOWTIE  or 
RECTANGULAR 


CX3 
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Volume  Search 


In  volume  search  the  sensor  covers  a limited  angular  area  (usually  but  not  always  out  to  a 
fixed  range)  and  attempts  to  determine  if  target  is  present  within  the  conical  volume  or  not.  Radars 
tend  to  search  in  range  as  well  as  angle;  simple  imaging  systems  search  only  in  angle  (although  a 
maximum  range  is  often  assumed).  In  every  instance,  the  angular  region  must  be  repeatedly 
searched. 

Search  of  the  angular  region  is  equivalent  to  an  area  search  except  that  it  almost  always  must 
be  completed  in  a specified  time.  The  revisit  time  is  usually  determined  from  system  considerations. 
For  example,  if  the  search  is  performed  by  track- while-scan  system  (see  Chapter  15  for  discussion 
of  tracking  systems),  then  the  search  must  be  repeated  at  a rate  that  is  consistent  with  the  tracking 
accuracy  required.  This  imposition  of  a specific  value  on  the  area  search  time  T is  a significant 
complication.  If  we  assume  a continuous  search  with  an  angular  swath  width  0,  an  angular  scan 
velocity  co,  and  a total  angular  area  Q to  be  search,  we  have  by  analogy  to  Eq.(  10.24) 

T^aiOco.  (10.41) 

Eq.(  1 0.4 1 ) is  simple  enough.  The  primary  complication  comes  from  the  need  to  dwell  on  each  point 
of  the  search  area  for  a finite  period  of  time.  The  performance  of  every  sensor  depends  on  the 
carrier- to-noise  ratio.  If  the  carrier-to-noise  ratio  goes  to  zero,  the  detection  performance  of  a sensor 
goes  to  zero.  In  every  instance,  carrier-to-noise  ratio  can  be  related  to  the  ratio  of  energy  received 
from  the  target  to  noise  energy  intrinsic  to  the  system.  Since  no  target  can  radiate  or  reflect  infinite 
power,  then  as  the  observation  time  goes  to  zero,  the  energy  received  from  the  target  must  go  to 
zero.  In  practice,  the  carrier-to-noise  ratio  will  go  to  zero  as  the  observation  time  goes  to  zero. 

Thus,  every  sensor  must  have  a minimum  dwell  time  r over  which  energy  for  a single 
observation  is  collected.  In  the  continuous  scan  can  described  above,  the  dwell  time  is  provided  by 
an  angular  width  (f)  perpendicular  to  the  swath  width.  The  dwell  time  can  be  calculated  to  be 

T^(l)lco  (10.42) 

with 

= (10.43) 

It  is  the  imposition  of  a maximum  value  on  the  scan  rate  eithat  causes  the  problems.  The  maximum 
value  for  scan  rate  means  that  there  is  a minimum  value  for  the  scan  time  T.  We  have  encountered 
one  of  those  situations  that  makes  systems  engineering  so  interesting.  It  is  desirable  to  make  T as 
small  as  possible  and  to  make  co  as  small  as  possible,  but  the  two  are  inversely  related.  The  trade 
between  the  two  has  no  unique  solution  and  any  selection  can  be  debated.  There  is  a limited  amount 
of  flexibility  available  by  increasing  ^and/or  cf),  but  sooner  or  later,  the  device  designer  will  object 
that  too  much  is  being  asked  for. 
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In  imaging  systems,  the  value  of  (j)  is  usually  assoeiated  with  the  angular  resolution  (see 
Chapter  12  for  a diseussion  of  resolution)  of  the  imaging  system.  The  value  of  6>is  usually  related 
to  the  number  of  detector  elements  in  the  array  N and  the  detector  angular  subtense  6,^ 

0^N0j,.  (10.44) 

If  the  imager  has  only  a single  detector  element  then  0 is  usually  associated  with  the  angular 
resolution  (as  is  ^). 

In  radar  systems,  the  problem  becomes  even  more  complicated.  In  a radar,  each  measure- 
ment requires  at  least  one  radar  pulse.  In  many  systems,  pulse  integration  may  cause  a large  number 
of  pulses  to  be  required  to  make  a single  measurement.  Thus,  if  the  radar  has  a pulse  repetition 
frequency  PRF  and  requires  Nj  pulses  per  measurement,  then  the  “dwell  time”  must  be 

t^Nj/  PRF  (10.45) 

and  the  maximum  scan  rate  becomes 

0)  = ^ / T = <p  PRF  / N j . (10.46) 

The  angle  (j)  is  most  often  associated  with  the  full-angle  beamwidth  of  the  radar,  although 
occasionally  it  is  associated  with  the  angular  resolution  of  the  radar  - the  actual  selection  is  usually 
a subject  of  a trade  concerning  the  effects  of  gaps  in  coverage  and  the  radial  dependence  of  the 
detection  probability. 

A second  complication  in  radar  systems  arises  from  the  requirement  that  each  radar  pulse 
must  travel  to  the  target  and  come  back  to  the  radar  before  the  radar  can  scan  to  a new  direction. 
This  requirement  imposes  a limit  on  the  PRF 

PRF HR  (10.47) 

where  R is  the  maximum  range  to  the  target  (i.e.,  the  range  associated  with  the  search  volume)  and 
c is  the  speed  of  light.  With  this  restriction,  the  maximum  scan  rate  becomes 

CO  = (f) PRF^^j^  I N j = (f)c 1 2RN j . (10.48) 

As  before  the  restriction  on  scan  rate  means  that  search  cannot  be  planned  independently  of  the 
specific  sensor  parameters  envisioned.  Trades  will  be  the  name  of  the  game  in  volume  search. 

Once  scan  rates  and  swath  sizes  have  been  determined,  the  scan  pattern  remains  to  be 
determined.  Figure  10-8  shows  some  of  the  generic  kinds  of  scan  patterns  that  are  used  to 
implement  volume  searches.  The  primary  distinction  between  them  is  how  they  are  implemented. 
For  example,  the  spiral  scan  is  a composite  of  a simple  circular  scan  (that  can  be  produced  by  a 
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Figure  10-8.  Commonly  used  volume  search  scan  patterns. 


c)  BALL  JOINT  (DOUBLE  SAWTOOTH) 


d)  LISSAJOUS  (DOUBLE  SINUSOID) 


rotating  antenna  optical  element)  and  a slow  linear  scan.  The  pattern  spirals  out  and  then  spirals  in. 
The  raster  scan  is  produced  by  combining  a fast  (horizontal  in  the  figure)  linear  scan  in  one 
dimension  and  a slow  linear  scan  in  the  orthogonal  direction.  Between  each  element  of  the  linear 
scan  there  is  a fast  “flyback”.  The  ball  joint  scan  is  a combination  of  two  simple  orthogonal 
sawtooth  (constant  velocity  back  and  forth)  scans  with  slightly  different  features.  It  can  be 
mechanically  implemented  using  a mirror  mounted  on  a rod  passing  through  a ball  joint  and  driven 
by  two  orthogonal  rotating  cams.  The  Lissajous  scan  is  named  after  the  famous  patterns  observed 
while  testing  oscilloscopes.  It  is  formed  by  combining  a vertical  sinusoidal  scan  with  a horizontal 
sinusoidal  scan  at  a slightly  different  frequency.  When  both  frequencies  are  integer  multiples  of  a 
common  frequency  then  a closed  form  pattern  (one  example  with  vertical  frequency  five  times  the 
horizontal  frequency  is  shown  in  the  figure)  will  result.  The  raster  scan  is  the  most  commonly  used 
pattern,  but  mechanical  design  concerns  have  resulted  in  all  of  these  patterns  being  used  in  one  real- 
world  system  or  another. 
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Incorporating  Effects  of  Non-Unity  Detection  Probability  into  Search 


The  preceding  sections  have  all  assumed  “cookie  cutter”  detection.  That  is,  the  detection 
probability  is  equal  to  one  everywhere  within  a finite  range  of  the  platform,  and  equal  to  zero  outside 
that  finite  range.  All  calculations  are  based  on  a search  radius  r = a,  where  a is  a characteristic 
cookie  cutter  distance),  a swath  width  {W  = 2a),  or  a beam  width  r = a or  d = 2a).  In  the  real 
world,  the  detection  probability  is  never  one.  As  we  shall  see  in  the  next  chapter,  the  detection 
probability  is  usually  a strong  function  of  the  carrier-to-noise  ratio  (CNR)  of  the  sensor  system.  As 
the  CNR  decreases,  the  detection  probability  drops  rapidly. 

For  example,  in  the  Swerling  2 radar  example  in  the  next  chapter  (Ch.ll),  the  detection 
probability  (for  10  '^  false  alarm  probability)  versus  CNR  (in  dB)  is  roughly 
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In  a radar  such  as  this,  the  CNR  is  inversely  proportional  to  the  fourth  power  of  range.  Thus,  a 1 .5 
dB  (41%)  increase  in  range  will  produce  an  6 dB  decrease  in  CNR.  Thus,  if  this  radar  were  flying 
on  a sensor  at  an  altitude  h that  produced  a detection  probability  of  90%  in  the  nadir  direction,  then 
as  this  radar  scans  from  the  nadir  out  to  an  angle  of  45°,  the  detection  probability  will  fall  from  90% 
to  a value  of  about  65%.  Scanning  out  to  an  angle  of  60°  (3  dB  increase  in  range)  would  cause 
detection  probability  to  fall  below  20%. 

This  same  radar  may  well  have  a transmitted  radar  beam  whose  transverse  intensity  profile 
varies  as  sin^(0/0g)  / (d/O^)^  where  Of^  is  a characteristic  half-angle.  The  CNR  may  be  assumed  to 
vary  as  the  square  of  this  quantity.  The  CNR  versus  angle  is 
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Assume  that  the  detection  probability  is  90%  at  Oa.  At  the  center  of  the  beam  the  CNR  is  3 dB  larger 
that  at  6q  yielding  a detection  probability  of  around  95%.  The  CNR  at  1 .5  is  4 dB  less  than  at  Og 
yielding  a detection  probability  that  is  only  about  75%.  The  CNR  at  2.0^,,  is  more  than  10  dB  less 
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than  at  yielding  a detection  probability  slightly  less  than  30%. 

The  preceding  discussion  should  immediately  raise  questions  such  as  “how  is  the  cookie 
cutter  range  determined?”  and  “how  is  the  transverse  variation  in  detection  probability  accounted 
for?”.  These  are  extraordinarily  difficult  problems  to  analyze.  However,  rough  guidelines  or 
heuristics  can  be  established.  First,  it  is  necessary  to  determine  what  total  detection  probability  is 
the  minimum  acceptable.  It  can  never  be  unity.  Typically,  values  ofO.9  to  0.99  are  acceptable.  The 
procedure  for  determining  an  acceptable  detection  probability  is  given  in  Chapter  1 1 . 

The  minimum  acceptable  value  of  detection  probability  is  typically  assigned  to  the  cookie 
cutter  parameter.  With  this  assignment,  everything  outside  the  cookie  cutter  will  have  less  than  the 
acceptable  detection  probability,  everything  inside  will  have  higher  detection  probability.  When 
used  with  the  search  examples  from  earlier  sections  in  this  chapter,  this  assignment  guarantees  that 
the  endpoint  of  the  search  will  be  at  least  the  acceptable  detection  probability.  This  approach  will 
underestimate  the  total  achievable  detection  probability  because  it  ignores  contributions  outside  the 
cookie  cutter  and  slightly  reduces  (to  the  value  at  the  edge)  contribution  from  inside  the  cookie 
cutter. 


The  dimension  of  the  swath  width  is  usually  sized  based  on  worst  case  considerations.  For 
example,  a radar  system  will  be  designed  to  provide  a desired  detection  probability  against  the 
smallest  reasonable  target  at  the  longest  required  range  under  the  worst  case  propagation  condition. 
If  any  of  these  conditions  becomes  worse  than  the  design  conditions,  then  the  radar  is  not  expected 
to  “work”,  i.e.,  produce  the  desired  detection  probability.  Given  the  radar’s  design  conditions  and 
the  desired  search  detection  probability  (which  may  not  be  the  same  as  the  radar’s  design  detection 
probability),  it  is  possible  to  determine  the  range  at  which  the  search  detection  probability  can  be 
obtained.  This  is  usually  used  to  determine  the  swath  width  of  a radar  sensor.  It  is  important  of 
course  to  take  the  geometry  into  account  (such  as  in  the  down-looking  scan  example)  and  never 
exceed  the  radar’s  detection  range. 

The  choice  of  an  effective  sensor  beamwidth  is  also  determined  using  worst  case  analysis. 
Under  the  worst  case  design  conditions,  the  “size”  of  the  beam  that  produces  the  desired  search 
detection  probability  can  be  evaluated.  This  then  is  assumed  as  the  size  of  the  cookie  cutter  in  the 
preceding  analyses. 
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Problems 


10-1.  An  exhaustive  search  is  performed  by  square  tiling  of  circular  detection  patches  at  a constant 
rate  instead  of  sweeping  a swath  at  constant  velocity.  The  circular  patches  have  radius  a and 
are  tiled  with  a center-to-center  spacing  2d.  One  circular  detection  patch  is  covered  in  each 
unit  r of  elapsed  time.  Show  that  the  probability  of  detection  as  a function  of  time  is 
approximately 


P{t)  - rj^Q^d^t  I tAj, 


where  is  given  by  Eq.(10.18).  How  much  time  does  it  take  to  cover  the  search  area? 
What  is  the  fundamental  difference  between  Eq.(  10.25)  and  the  expression  above? 

10-2.  An  immobile  target  is  known  to  be  in  the  vicinity  of  a reference  point  with  probability 
density 


1 -P/2c7^ 




where  r is  the  distance  from  the  reference  point  and  cA  is  the  variance.  An  exhaustive  search 
is  performed  with  swath  width  W and  search  velocity  V using  a spiral-out  search  pattern. 
Assuming  that  the  swath  width  W is  small  compared  to  a;  then  show  that  the  probability  of 
detection  as  a function  of  time  is  approximately 

where 

Y = WV  I2kcj\ 


What  is  the  mean  time  required  to  detect  the  target  given  this  situation? 

10-3.  A rectangular  volume  of  1 degree  vertical  extent  and  2 degree  horizontal  extent  and  15  km 
range  is  to  be  covered  by  a perfect  raster  scan.  The  search  sensor  has  a circular  coverage 
with  angular  resolution  of  0.0 1 degrees.  Derive  a generic  expression  for  the  maximum  frame 
rate  (number  of  complete  searches  per  second)  that  this  sensor  can  generate.  Evaluate  this 
expression  for  the  specific  data  provided. 
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10-4.  The  search  sensor  described  has  a raster  scan  efficiency  defined  by 


^ SCAN 


^ LineScan 


^ LineScan  ^Flyback 


where  t^^escan  is  tin^s  required  for  the  scan  to  sweep  from  one  side  of  a scan  line  to  the 
other  and  is  the  time  required  for  the  scan  to  (more  rapidly)  sweep  back  to  the  original 
side.  How  would  the  expression  derived  in  Problem  10-3  be  modified  to  account  for  the 
raster  scan  efficiency? 
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CHAPTER  11 


DETECTION 


What  is  Detection? 

Detection  is  the  set  of  processes  by  which  a system  determines  the  likely  presence  of  an 
object  of  potential  interest  within  the  “space”  being  “observed”  by  the  system.  This  definition  is 
intentionally  vague  because  there  are  many  different  ways  implement  a detection  system  using 
sensors  and  signatures  of  all  forms.  For  example,  a magnetic  influence  mine  detects  a ship  target 
when  the  nearby  presence  of  a ship  causes  the  magnetic  field  in  the  vicinity  of  the  mine  to  change 
by  more  than  a predetermined  amount.  A sonar  operator  detects  a submarine  when  the  sound  from 
the  submarine  becomes  loud  enough  relative  to  background  noise  for  the  operator  to  perceive  the 
significant  characteristic  differences.  A radar  detects  a target  when  the  amplitude  of  the  backscat- 
tered  electromagnetic  radiation  produces  an  electrical  signal  that  exceeds  a predetermined  threshold 
voltage.  A sniper  detects  the  enemy  trooper,  when  he  can  distinguish  enough  visual  characteristics 
to  discriminate  the  enemy  from  the  background.  In  every  instance,  there  is  a signal  which  is 
characteristic  of  “target”  and  a signal  characteristic  of  “background”  or  “noise”.  Detection  occurs 
when  an  individual  or  an  electronic  system  can  sufficiently  discriminate  “target”  from  “background” 
or  “noise”  to  declare  the  target  to  be  present  with  sufficient  certainty  that  the  system  should  take 
action  (detonate  the  mine,  establish  a track  on  the  target,  or  pull  the  trigger). 

We  will  differentiate  between  two  different  forms  of  detection,  because  their  applications 
and  analysis  seldom  overlap.  The  first  of  these  is  the  detection  of  the  existence  of  a basically  one- 
dimensional signal  (amplitude  versus  time)  in  the  presence  of  noise  from  any  source.  This  problem 
is  the  basis  for  the  field  of  detection  theory.  We  will  investigate  this  type  of  detection  in  consider- 
able more  detail  in  this  chapter.  It  finds  use  in  active  sensors  and  passive  sensors,  electromagnetic 
sensors  operating  from  the  radio  frequency  to  the  ultraviolet,  acoustic  sensors,  and  sensors  operating 
on  esoteric  observables  such  as  field  anomalies.  As  we  shall  see,  a common  formalism  exists  which 
permits  accurate  modeling  of  signal  detection  phenomena  regardless  of  sensor  type. 

The  second  kind  of  detection  is  that  based  on  human  visual  observation  of  an  image.  Image 
signal  characteristics  other  than  amplitude  play  a part  in  human  perception.  For  example,  contrast 
(brightness  difference  between  target  and  background)  is  more  important  than  total  brightness. 
Shape  and  surface  texture  may  permit  detection  when  contrast  is  extremely  poor.  The  process  of 
image -based  detection  is  complicated  by  the  presence  of  clutter  (real  but  uninteresting  objects), 
finite  resolutions  of  the  images,  and  motions  of  targets,  backgrounds,  or  both.  Image-based 
detection  is  part  of  a larger  psychophysical  problem  of  image  understanding.  Because  image-based 
detection  differs  so  strongly  from  signal-based  detection,  we  will  cover  it  in  a separate  chapter  in 
this  volume  (Chapter  14). 
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Basics  of  Detection  Theory 

In  the  following  few  pages  we  will  investigate  the  detection  of  a radar  signal  in  the  presence 
of  radar  receiver  noise.[l]  A radar  was  chosen  to  illustrate  this  process  for  several  reasons.  First, 
a concrete  example  is  more  satisfying  than  a purely  hypothetical  one.  Second,  deriving  all  of  the 
expressions  for  a radar  at  this  time  will  spare  us  from  having  to  do  so  when  the  chapters  on  radars 
are  reached.  Thirdly,  the  rapid  pace  of  development  of  radar  in  World  War  II  let  hardware  design 
exceed  the  limits  of  existing  theory.  As  part  of  an  attempt  to  keep  pace,  the  field  of  detection  theory 
was  invented  and  almost  universally  applied  first  to  radar  systems.  The  process  described  here 
closely  mirrors  the  historical  development  of  the  field  using  the  examples  that  drove  that  develop- 
ment. 


The  fundamental  question  facing  any  signal-detection  system  is  whether  or  not  a target  is 
present  at  any  given  time.  If  we  simplify  the  problem  to  its  most  basic  form,  we  find  there  are  two 
hypotheses: 

Hypothesis  0 (Hq):  Target  Absent 

Hypothesis  1 (Hj):  Target  Present 

There  are  no  other  possibilities.  A target  cannot  be  both  present  and  absent.  If  multiple  targets  are 
present,  then  Hypothesis  1 is  true  by  default.  We  cannot  ask  if  a particular  kind  of  target  is  present. 
This  goes  beyond  mere  detection.  Thus,  a radar  detection  system  is  tasked  to  decide  which 
hypothesis,  Hg  or  Hj  is  true  based  on  processing  of  the  radar  return. 

The  Neyman-Pearson  approach  to  binary  hypothesis  testing  is  to  maximize  the  detection 
probability 

- Pp  is  the  probability  that  Hj  is  chosen,  given  that  Hj  is  true  - 

subject  to  the  constraint  of  keeping  the  false  alarm  probability 

- Pp  is  the  probability  that  Hj  is  chosen,  given  that  Hq  is  true  - 

below  some  preselected  value  s.  Neyman-Pearson  is  not  the  only  approach  that  could  have  been 
taken.  For  example,  (a)  one  could  minimize  Pp  subject  to  the  constraint  of  keeping  P^  larger  than 
some  value  i^or  (b)  one  could  attempt  to  maximize  Pp,  subject  to  the  constraint  of  keeping  the  ratio 
Pp  /Pp,  less  than  some  value  r\ . However,  the  first  alternate  approach  put  less  importance  on  the  basic 
detection  function  and  tend  to  give  less  than  optimum  detection  performance.  The  second  alternate 
approach  put  less  emphasis  on  false  alarms;  although  detection  probability  may  be  enhanced,  it  is 
more  difficult  to  control  false  alarms.  In  practice,  the  Neyman-Pearson  approach  has  been  deemed 
to  be  optimum  for  detection  problems. 

The  optimum  Neyman-Pearson  processor  for  a continuous  variable  V (such  as  the  voltage 
output  from  a radar  video  detector)  is  a likelihood  ratio  (threshold)  test: 


> 

Choose  Hi, 

(11.1a) 

Vt 

< 

Choose  Hq, 

(11.1b) 

where  Vj-  is  chosen  to  yield  Pp  = s. 
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What  does  this  test  mean  for  detection?  If  the  signal-plus-noise  voltage  (i.e.,  the  total  output 
of  the  detector)  exceeds  the  threshold  voltage,  the  system  will  declare  a target  to  be  present  - 
whether  or  not  a target  is  really  present.  If  the  target  is  really  present,  we  have  a valid  detection. 
If  the  target  is  actually  absent,  we  have  a false  alarm.  If  the  signal-plus-noise  voltage  fails  to  exceed 
the  threshold,  the  system  declares  that  no  target  is  present  - whether  or  not  the  target  is  really  absent. 
The  question  should  be  asked  as  to  why  one  can  have  a threshold  crossing  in  the  absence  of  a target. 
The  answer  lies  in  the  character  of  the  system  noise. 

Let  us  consider  the  case  where  no  target  is  present.  The  output  of  the  receiver  is  due  entirely 
to  system  noise.  Such  noise  is  generated  internally  in  every  electrical  system.  There  is  no  way  to 
completely  eliminate  system  noise.  One  of  the  characteristics  of  noise  is  that  it  tends  fluctuate  in 
amplitude  about  an  average  value,  as  shown  in  Figure  11-1.  However,  occasionally,  the  amplitude 
will  fall  well  below  average,  and  on  other  occasions,  the  amplitude  may  spike  well  above  average. 
The  higher  the  amplitude  of  the  spike,  the  less  likely  it  is  to  occur.  Nevertheless,  there  is  a non-zero 
probability  that  a spike  of  any  magnitude  can  be  produced. 


Figure  11-1.  Typical  behavior  of  system  noise.  Vq  is  the  rms  noise  level;  is  a threshold. 


ENVELOPE  OF 
NOISE  VOLTAGE 


The  fraction  of  any  long  time  interval  that  the  noise  voltage  lies  above  the  threshold  voltage 
can  be  equated  to  the  false  alarm  probability.  The  higher  the  threshold,  the  smaller  the  fraction  of 
time  that  it  will  be  exceeded,  and  the  lower  the  false  alarm  probability.  For  classical  radar  receivers 
the  system  noise  has  been  characterized  by  the  following  probability  density  function 

= (11.2) 

Equation  (11.2)  results  when  Gaussian  noise  is  narrow-band  filtered.  The  rms  noise  Fp  is 
determined  by  the  system  temperature  and  the  filter  bandwidth  among  other  things. 

By  way  of  a quick  review,  the  probability  density  function  is  that  function  of  a random 
variable  whose  definite  integral  over  an  interval  gives  the  probability  that  the  variable  lies  within 
that  interval,  i.e. 

J dx  Px{x)  = Probability  that  a > x > b . (11-3) 
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Thus  we  would  expect  that  the  false  alarm  probability  can  be  directly  determined  by  definite 
integration  of  the  noise  probability  density  function  for  the  interval  lying  above  the  threshold.  Thus, 
we  find 


PF  = ldVp„{V). 


(11.4) 


Equation  (1 1.4)  is  a general  expression.  It  will  be  valid  for  any  and  every  sensor  system  provided 
we  know  the  form  of  the  appropriate  Pj/V)  function.  For  the  case  of  radar  noise,  the  integration 
gives  a straightforward  closed-form  result  (this  does  not  happen  for  every  possible  noise  density) 


Pp  = 


(11.5) 


These  results  are  shown  graphically  in  Figure  11-2.  The  total  area  under  the  curve  pJV)  is  unity. 
The  shaded  area  (the  area  lying  above  the  threshold  voltage)  is  equal  to  the  false  alarm  probability. 


Figure  11-2.  Graphical  representation  of  false  alarm  probability  for  a radar. 


Pn'V) 


If  the  false  alarm  probability  of  a system  is  very  low,  then  a considerable  amount  of  time  may 
pass  between  false  alarms.  In  more  than  a few  systems,  it  is  convenient  to  specify  an  average  time 
between  false  alarms,  rather  than  specifying  a false  alarm  probability.  For  example,  requiring  that 
a radar  should  have  less  than  one  false  alarm  per  hour  is  more  meaningful  than  requiring  that  radar 
to  have  a false  alarm  probability  less  than  10  ",  even  though  the  two  requirements  may  be 
equivalent.  If  we  define  Tp  to  be  the  mean  time  between  false  alarms  and  we  define  B to  be  the 
bandwidth  of  the  radar  that  determines  its  electrical  noise  characteristics  (usually  the  bandwidth  of 
a filter  in  the  IF  section  of  the  receiver),  then  the  equivalent  false  alarm  probability  can  be 
determined  from  the  relation 

Pp  = HTpB  (11.6) 

For  example,  let  us  assume  that  a radar  is  desired  to  have  a mean  time  between  false  alarms  of  one 
hour  and  that  the  radar  has  an  IF  limiting  bandwidth  of  10  MHz,  then 
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Pp  = l/[(3600  seconds)  x (10,000,000  Hz)]  = 2.777  x lO'". 


If  a target  is  present,  then  the  voltage  produced  by  its  signal  adds  to  the  system  noise  voltage. 
The  probability  density  function  is  altered.  If  the  target  produces  a constant  signal  of  voltage  and 

the  radar  possesses  narrowband  Gaussian  noise,  the  probability  density  for  signal-plus-noise 
becomes 


Ps.«{'')  = (VIV«)e 


{v^  + A^)l2Vi 

^0 


{VAIV^) 


(11.7) 


where  Ig(x)  is  a Bessel  function  of  purely  imaginary  argument.  Equation  (1 1.7)  is  only  valid  when 
the  target  is  present.  Thus,  the  integral  of  the  probability  density  above  threshold  should  be  equal 
to  the  probability  of  detection 


Po=\ldV  p,^,{V) 


(11.8) 


Equation  (1 1 .8)  is  valid  for  any  sensor  system  if  the  correct  function  (not  necessarily  Eq.  (1 1 .7))  for 
Ps+n(V^)  is  used.  Unfortunately,  the  integral  of  Eq.  (11.8)  using  the  function  in  Eq.  (11 .7),  cannot  be 
evaluated  in  closed  form.  The  integral  function  defined  in  Equation  (1 1 .9)  is  called  Marcum’s  Q- 
function.  It  is  relatively  easily  evaluated  numerically. 


Q^{A,r„;V,)=rdV  (11.9) 


Figure  11-3  graphically  illustrates  the  probability  of  detection  and  compares  it  to  the  false  alarm 
probability. 


Figure  11-3.  Graphical  representation  of  detection  probability  of  a radar. 
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Figure  11-3  gives  us  the  first  hint  of  the  contradictory  (or  perhaps  complementary)  natures 
of  Pp  and  Pp.  Detection  probability  can  be  increased  (which  is  good)  by  lowering  the  threshold. 
However,  lowering  the  threshold  means  increasing  the  false  alarm  probability  (which  is  bad). 
Before  addressing  this  point  further,  it  is  important  to  introduce  some  new  terminology.  One  term 
is  familiar  to  everyone,  but  understood  by  few;  the  other  term  is  unfamiliar  to  most  people. 

The  output  of  the  receiver  is  a complex  function  r which  has  two  contributors:  a signal 
component  j;  and  a receiver  noise  component  n,  [2] 

H = y + n (11.10) 


The  signal-to-noise  ratio  SNR  is  a quantity  that  tells  us  how  much  information  a signal  can  convey. 
It  is  defined  by  the  following  expression 


SNR  = 


(Mean  Square  Signal  Power) 
(Mean  Square  Fluctuation  Level) 


(11.11) 


Everyone  thinks  they  know  what  signal-to-noise  ratio  means.  However,  most  authors  define  the 
SNR  as  the  signal  power  divided  by  the  noise  power.  The  information  theory  definition  recognizes 
that  random  fluctuations  in  signal  level  can  occur  in  addition  to  random  fluctuations  in  noise  level. 
Both  kinds  of  fluctuations  limit  the  ability  to  extract  information.  The  term  that  most  authors  call 
signal-to-noise  ratio  is  actually  the  carrier-to-noise  ratio  defined  below 


CNR  = 


(Mean  Signal  Power) 
(Mean  Noise  Power) 


(11.12) 


It  is  the  CNR  that  is  calculated  by  the  radar  equation,  not  the  SNR  (although  few  radar  texts  use  this 
more  modem  terminology).  We  are  attempting  to  give  the  system  engineer  a new  tool  that  he  can 
use  to  analyze  every  kind  of  sensor  from  a common  perspective.  In  other  sections  of  this  work  we 
will  need  to  distinguish  between  the  two  quantities.  For  that  reason  we  introduce  them  both  at  this 
time.  In  almost  every  instance  that  the  reader  will  find  in  outside  sources,  he  should  feel  free  to 
substitute  CNR  for  SNR  and  they  would  be  consistent  with  the  present  work.  The  author  has  made 
every  attempt  to  be  100%  consistent  in  his  use  of  CNR  vs  SNR,  however,  there  may  be  occasional 
mistakes  in  not  altering  expressions  obtained  from  others. 

For  radar  systems  it  is  possible  to  model  the  noise  n as  a zero-mean,  complex,  Gaussian 
random  variable  with  unit  variance.  [2]  Using  this  normalization  we  have 
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(11.13) 


(11.14) 


for  the  normalized  mean  and  varianee  of  the  output  of  the  radar  matched  filter  detector.  The  factor 
of  unity  in  Eq.  (11.14)  corresponds  to  a target-independent,  constant  term  due  to  receiver  noise. 
Combining  Eqs.  (11.11)  and  (11.14)  yields 


SNR  = 

varl 


+ 1 


(11.15) 


Substitution  of  the  definition  of  CNR  into  this  expression  yields 


SNR  = 


CNR  1 2 


\ + CNR 


+ {icnrY 


(11.16) 


If  we  consider  only  the  situation  where  CNR  > 5,  it  is  possible  to  neglect  the  {2CNR)'^  term  in  the 
denominator  of  Eq.  (11.16).  If  we  further  define  a quantity 


SNR,,, 


(11.17) 


Eq.  (11.16)  can  be  written  in  a universal  form 


SNR 


CNR 


2SNR 


SAT 


1 + 


CNR 


2SNR 


SAT 


(11.18) 
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This  is  plotted  in  Figure  11-4.  At  very  large  CNR,  SNR  asymptotically  approaches  SNRs^’p.  At 
smaller  values  of  CNR,  SNR  is  approximately  CNR/2.  Thus  we  see  that  SNRg^j.  is  the  largest  SNR 
that  can  be  achieved  for  any  CNR.  It  should  be  noted  that  for  Swerling  II  cross  section  statistics  (see 
section  on  target  fluctuations)  SNR^^j.  = 1. 


Figure  11-4.  Universal  curve  for  normalized  signal- to-noise  ratio 
versus  normalized  carrier-to-noise  ratio. 


CNR/SNRsat 


In  our  radar  example  above,  the  carrier-to-noise  ratio  can  be  easily  calculated  to  be 

CNR  = !V^.  (11.19) 

The  signal-to-noise  ratio  is  given  by 

SNR  = nvl  = CNR! 2.  (11.20) 

A thorough  examination  of  Equations  (11.5)  and  (11.9)  indicates  a complex  interrelationship 
between  detection  probability,  false  alarm  probability,  and  carrier-to-noise  ratio.  Consider  the 
functions  plotted  in  Figure  11-5.  Note:  the  curve  for  CNR=0  is  the  same  as  the  curve  for  noise 
alone. 
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Figure  11-5.  Graphical  representation  of  the  effects  of  changing  CNR  on  Ps+n(V). 


p(V) 


As  CNR  increases,  the  peak  of  the  probability  density  function  shifts  to  higher  and  higher  values. 
Consequently,  for  any  fixed  threshold,  the  probability  of  detection  increases  with  increasing  CNR. 
The  false  alarm  probability  is  unaffected  by  CNR  changes.  As  the  threshold  is  increased  (for  fixed 
CNR),  the  false  alarm  probability  decreases,  but  the  detection  probability  also  decreases. 
Conversely,  lowering  the  threshold  to  increase  the  probability  of  detection  produces  an  accompany- 
ing increase  in  the  false  alarm  probability.  To  obtain  an  increase  in  probability  of  detection  and  a 
decrease  in  false  alarm  probability,  it  is  necessary  to  increase  the  threshold  while  simultaneously 
increasing  the  CNR  even  faster. 

These  complex  interrelationships  between  Pp,  Pp,  and  CNR  are  summarized  in  the  receiver 
operating  characteristic  (or  ROC).  The  ROC  may  take  different  forms.  However,  one  of  the  more 
useful  forms  is  a plot  of  P^  vs.  CNR,  parameterized  in  Pp.  For  the  radar  example  we  have  been 
following,  the  ROC  is  shown  in  Figure  11-6.  The  ROC  is  a key  tool  in  the  sensor  engineer’s 
toolbox.  It  is  used  in  the  following  fashion.  From  higher  level  system  specifications,  the  designer 
will  have  been  given  acceptable  levels  of  false  alarms  and  required  detection  probabilities.  Given 
P]3  and  Pp(or  Tp),  he  then  finds  the  appropriate  curve  on  the  chart  and  determines  what  value  of  CNR 
is  required  to  yield  the  required  Pp  & Pp  performance.  The  job  of  the  sensor  designer  then  becomes 
the  design  of  the  sensor  to  yield  that  required  CNR  or  better  under  all  operational  conditions  of 
interest.  For  the  ROC  shown  in  the  figure,  if  Pp  > 0.90  and  Pp  < 10'^  are  required  to  achieve 
acceptable  system  performance,  then  a CNR  > 14.2  dB  is  required.  As  indicated  in  the  chapter  in 
radiometry  and  as  will  be  explicitly  derived  in  the  chapter  on  radar,  a “radar  range  equation”  can  be 
defined  which  relates  CNR  to  explicit  radar  parameters  such  as  power,  antenna  size,  and  frequency 
and  to  operational  parameters  such  as  target  range  and  target  radar  cross  section.  Given  the  required 
CNR,  the  radar  designer  can  use  the  range  equation  to  perform  trade  studies  to  determine  the 
optimum  choice  of  radar  parameters. 
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Figure  11-6.  Receiver  operating  characteristic  for  a radar  with  a non- fluctuating  target. 
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Impact  of  Signal  Fluctuations  on  Detection 

The  ROC  generated  in  the  preceding  section  explicitly  assumed  a target  with  a constant 
signal  level  A.  Unfortunately,  the  returns  from  real  target  are  seldom  constant  in  amplitude.  For  a 
variety  of  reasons,  the  target  returns  usually  fluctuate  violently,  even  if  the  radar  illumination  is 
constant.  Possible  significant  contributors  to  signal  level  fluctuations  include: 

* Misalignment  of  the  radar  beam  with  respect  to  the  target  (due  to  beam  scanning, 
pointing  jitter,  target  dynamics,  or  tracking  errors) 

* Changes  in  target  orientation 

* Constructive  and  destructive  interference  between  reflections  from  different  parts 
of  the  targets 

* Multipath  interference  (between  direct  and  indirect  paths  between  the  radar  and 
the  target) 

* Temporal  and/or  spatial  variations  in  atmospheric  absorption 

* Temporal  and/or  spatial  variations  in  atmospheric  scattering  (e.g.,  from  dust, 
clouds,  smoke,  or  fog) 

* Presence  of  precipitation  (rain,  snow,  hail,  etc.) 

* Atmospheric  turbulence 

* Ionospheric  Scintillation 

* Jamming 

In  general,  most  fluctuations  can  be  classified  as  being  either  target- induced  or  propagation  channel- 
induced. 

The  presence  of  signal  fluctuations  significantly  alters  the  probability  density  of  the  signal- 
plus-noise.  In  turn,  the  receiver  operating  characteristic  will  be  correspondingly  altered.  In  many 
instances,  the  fluctuation  statistics  can  be  adequately  modeled  or  measured.  If  this  is  possible,  then 
modified  receiver  operating  characteristics  can  be  constructed. 

If  we  consider  target-induced  fluctuations,  then  any  measured  signal  strength  (CNR)  can  be 
related  to  an  effective  radar  cross  section  (a).  In  turn  any  signal  voltage  level  A can  be  related  to 
a square  root  of  that  effective  cross  section.  Fluctuations  in  A can  be  considered  to  be  equivalent 
to  fluctuations  in  a.  That  is, 

CNR  cc  a =>  A cc  (11-21) 

For  convenience  let  us  rewrite  Marcum’s  Q function  (the  detection  probability  for  non- fluctuating 
signals  - Eq.  (10.9))  in  the  following  fashion  where  we  have  highlighted  the  implicit  dependence 
on  cross  section  (A  = aN^^) 

Qm(o-,V„;  yT)=i  dV  (11.22) 

^ V j' 

If  the  probability  of  seeing  any  given  value  of  effective  cross  section  is  p((j),  then  if  we  average 
Marcum’s  Q function  over  this  distribution  of  possible  cross  section  values  we  should  obtain  the 
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detection  probability  averaged  over  the  effective  cross  section  fluctuations 


^z)  = Jo  (11-23) 

The  trick  is  to  obtain  the  statistics  for  the  cross  section  fluctuations. 

Peter  Swerling  did  just  that  for  several  model  radar  targets.  [3]  These  models  are  summarized 
in  Table  11-1.  His  first  case  consisted  of  a large  number  of  small  reflectors  of  roughly  equal  size, 
spaced  randomly  over  a volume  large  compared  to  a wavelength  of  the  radar  being  considered. 
Interference  will  occur  between  each  reflector  and  every  other  reflector.  The  net  result  of  all  the 
constructive  and  destructive  interferences  was  an  effective  cross  section  that  was  Rayleigh 
distributed. 


Table  11-1.  Swerling’s  models  of  radar  cross  section  fluctuations.  [3] 


SWERLING 

CASE 

CROSS  SECTION 
DISTRIBUTION 

CORRELATION 

TIME 

SCAN-TO-SCAN 

BEHAVIOR 

PULSE-TO-PULSE 

BEHAVIOR 

'O' 

DIRAC  DELTA  - 
ONE  STEADY  REFLECTOR 

INFINITE 

COHERENT 

COHERENT 

1 

RAYLEIGH  - 
NUMEROUS  (>5) 
RANDOM  REFLECTORS 

LONG 

NONCOHERENT 

COHERENT 

II 

SHORT 

NONCOHERENT 

NONCOHERENT 

III 

RICE  - 

ONE  LARGE  STEADY  & 
NUMEROUS  SMALL 

LONG 

NONCOHERENT 

COHERENT 

IV 

RANDOM  REFLECTORS 

SHORT 

NONCOHERENT 

NONCOHERENT 

That  is,  the  cross  section  has  the  following  probability  distribution 

Prayleigh{^)  ~ ^ (11.24) 

<^AV 

where  is  the  average  (mean)  value  observed  for  the  effective  cross  section.  In  Swerling’s  later 
calculations  he  also  assumed  different  correlation  times  for  the  cross  section.  The  correlation  time 
is  essentially  the  time  over  which  the  cross  section  can  be  assumed  to  be  constant.  If  one  value  of 
cross  section  is  observed  at  time  T,  then  if  a second  observation  is  made  a time  r later,  the  two 
values  will  be  the  same  if  the  time  difference  ris  small  compared  to  the  correlation  time.  If  ris 
large  compared  to  the  correlation  time,  then  a different  value  of  cross  section,  randomly  selected 
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from  the  cross  section  distribution  will  be  observed.  Swerling  had  two  time  scales  of  interest:  the 
time  between  successive  scans  of  the  radar  (usually  seconds)  and  the  time  between  successive  pulses 
from  the  radar  (usually  fractions  of  a millisecond).  For  computational  purposes,  he  assumed  one 
collection  of  random  scatterers  had  a long  correlation  time  (larger  than  milliseconds  but  smaller  than 
seconds).  Such  a model  would  exhibit  a difference  cross  section  every  scan  of  the  radar,  but  would 
have  a constant  cross  section  pulse  to  pulse  within  a scan.  He  called  this  Case  I.  He  also  assumed 
a second  collection  of  random  scatterers  would  have  a short  correlation  time  (smaller  than  a fraction 
of  a millisecond).  Such  a model  would  exhibit  a different  cross  section  for  every  radar  pulse.  He 
called  the  short  correlation  time  model  Case  II. 

Swerling  put  forward  a second  model  which  was  composed  of  one  large  steady  (non- 
fluctuating) scatterer  plus  numerous  small  random  reflectors  whose  integrated  return  was  compara- 
ble to  that  of  the  large  single  scatterer.  This  model  produces  a Rice  distribution  for  the  cross  section 

Price((^)  = . (11.25) 

^AV 

Once  again  he  proposed  two  correlation  time  behaviors  and  called  the  respective  instances  Case  III 
and  Case  IV.  For  completeness,  this  author  has  called  the  non-fluctuating  target  as  Swerling  Case 
“0",  although  this  was  not  Swerling’s  usage.  The  cross  section  distribution  for  a non-fluctuating 
target  is  a Dirac  delta  function 


P DIRAci,^^  ~ Av\  (11.26) 

Swerling  took  his  models  and  basically  performed  the  averaging  described  above.  What  he 
found  was  that  fluctuating  targets  required  higher  CNR  to  detect  than  did  non-fluctuating  targets. 
The  higher  the  desired  detection  probability,  the  larger  the  increase  in  CNR  required  to  yield  that 
detection  probability.  However,  the  results  were  virtually  independent  of  false  alarm  probability. 
This  is  summarized  graphically  in  Figure  11-7.  Figure  1 1-7  is  used  in  the  following  fashion.  After 
determining  (from  Figure  11-6)  the  CNR  required  to  yield  desired  detection  and  false  alarm 
probabilities  for  a non- fluctuating  target,  the  additional  CNR  required  to  detect  a fluctuating  target 
is  determined  for  the  same  detection  probability  from  the  curve  for  the  appropriate  Swerling  Case. 
For  example,  if  Pd=  0.9  and  Pp  = 10'^  is  desired  for  Swerling  II,  then  from  Figure  1 1-6  we  require 
CNR  = 14.2  dB  for  the  non- fluctuating  case  plus  8 dB  from  Figure  11-7  to  accommodate  the 
Swerling  II  fluctuations  for  a total  of  22.2  dB.  If  the  radar  can  provide  22.2  dB  CNR  then  the  desired 
detection  and  false  alarm  probabilities  will  be  achieved. 

It  often  transpires  that  Swerling  II  is  a good  worst  case  model  for  fluctuations  of  real  targets. 
For  this  reason  it  is  useful  to  have  an  explicit  receiver  operating  characteristic  for  this  case.  One  can 
be  generated  in  either  of  two  ways.  First,  the  values  of  ACNR,  the  additional  CNR  required  to 
accommodate  the  Swerling  II  fluctuations,  can  be  extracted  from  Figure  11-7  and  added  to  each 
curve  in  Figure  11-6.  The  second  way  is  to  explicitly  evaluate  Equations  (11 .23)  and  (11 .24).  For 
the  case  of  Rayleigh  cross  section  statistics  only,  the  equations  can  be  solved  to  yield  the  simple 
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ADDITIONAL  SINGLE-PULSE  CNR 
REQUIRED  FOR  DETECTION  (dB) 


closed  form  solution 


(l/(l+C7Vt?)) 


(11.27) 


Either  way,  the  eurves  of  Figure  11-8  result.  This  is  the  reeeiver  operating  charaeteristic  for 
Swerling  II  radar  eross  section  statistics. 


Figure  11-7.  Additional  CNR  required  to  aecommodate  target  fluctuations. 
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Figure  11-8.  Receiver  operating  characteristic  for  Swerling  II  statistics. 
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Impact  of  Integration  on  Detection 

The  presence  of  fluctuation  causes  significant  increases  in  the  required  carrier-to-noise  ratio 
to  make  a detection.  The  additional  10-20  dB  required  may  prove  to  be  the  difference  between  a 
practical  system  and  an  impractical  system.  As  we  will  see,  increased  CNR  can  only  be  obtained 
at  the  expense  of  increased  transmitter  power,  increased  aperture  size,  or  decreased  useful  range. 
Few  systems  can  afford  any  of  these  changes.  Therefore,  it  is  of  critical  importance  to  investigate 
approaches  that  may  alleviate  this  potential  problem.  Pulse  integration  is  arguably  the  most 
important. 

When  the  signal  fluctuations  result  from  interference  processes  such  as  in  the  Swerling 
models,  the  phases  of  each  target  return  will  be  random  from  return  to  return.  The  optimum 
processing  strategy  is  to  non-coherently  integrate  N pulses.  Specifically,  the  returns  from  N pulses 
are  added  and  normalized  by  the  number  of  pulses,  and  then  compared  to  a threshold  that  is 
optimized  for  the  N-pulse-integrated  return. 


N 


m=l 


> 


Yn 


< Yn 


Choose  Hj 
Choose  Hq 


(11.28) 


There  is  a computationally  intensive  process  involving  moment  generating  functions  that  permits 
evaluation  of  detection  and  false  alarm  probabilities  in  pulse-integrated  systems.  The  mathematics 
is  beyond  the  scope  of  this  text.  However,  a useful  summary  of  results  is  shown  in  Figure  11-9. 


Figure  11-9.  Integration  improvement  factors  for  N-pulse  noncoherent  integration. 
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The  integration  improvement  factor  is  the  amount  by  which  the  single-pulse  CNR  can  be 
reduced  if  N-pulse  optimum  integration  is  used.  In  our  earlier  example  with  Swerling  II  statistics, 
Pp  = 0.9,  and  Pp  = 10'^,  a single  pulse  CNR  of  roughly  22  dB  was  required.  From  Figure  1 1-9  we 
observe  that  for  Pp,  = 0.9  and  Swerling  II,  then  the  integration  improvement  factor  for  N=  10  pulses 
is  15  dB.  This  means  that  if  ten  pulses,  each  yielding  a CNR  of  only  7 dB  (=  22  dB  - 15  dB),  are 
integrated  then  the  desired  Pp,  and  Pp  will  be  obtained. 

It  is  interesting  to  note  that  the  integration  improvement  factor  can  yield  a numerical 
improvement  that  is  larger  than  the  numerical  value  of  the  number  of  pulse  that  are  integrated.  This 
is  surprising  because  in  statistics  it  is  often  true  that  standard  deviations  decrease  by  the  inverse 
square  root  of  the  number  of  samples  obtained  (the  variances  decrease  by  one  over  the  number  of 
samples).  However,  consider  the  exact  nature  of  the  integration  process.  N samples  of  the  signal 
are  averaged  andN  samples  of  the  noise  are  averaged.  The  averaged  signal  divided  by  the  averaged 
noise  might  produce  a factor  of  1/N.  However,  because  the  noise  is  reduced,  the  threshold  can  be 
reduced  while  maintaining  the  same  false  alarm  probability.  If  noise  fluctuations  are  decreased,  the 
threshold  must  be  reduced  if  the  same  rate  of  exceedance  is  to  be  produced.  However,  reducing  the 
threshold  to  maintain  the  desired  false  alarm  probability  has  the  beneficial  effect  of  increasing  the 
detection  probability.  If  a fixed  detection  probability  is  acceptable  then  the  CNR  can  be  further 
decreased.  It  is  the  combination  of  averaging  out  signal  fluctuations,  averaging  out  noise 
fluctuations,  and  reducing  the  threshold,  that  gives  integration  improvement  factors  that  may  be 
greater  than  N. 

Pulse  integration  is  not  free  of  costs  or  penalties.  The  integrator  requires  considerable 
memory  and  sophisticated  arithmetic  computation  units  and  is  considerably  more  complicated  than 
a simple  threshold  comparator.  Thus  dollar  costs  will  increase.  Each  measurement  consumes  the 
time  required  to  gather  N pulses.  The  time  required  to  search  a specified  angular  field  of  view  will 
increase  proportional  to  the  number  of  pulses  integrated.  In  summary,  integration  increases  cost  and 
decreases  search  rate  in  exchange  for  permitting  reduced  power  and  aperture  and/or  increased  range. 

As  mentioned  above,  optimum  pulse  integration  requires  a complicated  processor.  There 
are  two  sub-optimal  integration  approaches  that  have  considerably  simpler  processors.  One-out-of- 
N pulse  detection  involves  transmitting  N pulses  and  declaring  a detection  if  any  of  the  N pulses 
exceeds  the  single-pulse  threshold.  If  Pf(1)  is  the  single -pulse  false  alarm  probability  and  Pd(I)  is 
the  single-pulse  detection  probability,  then  the  probability  of  having  1 false  alarm  in  N tries  Pp(N) 
is  given  by 

P^(V)  = l-[l-/>,(l)f  (11.29) 


while  the  probability  of  having  1 detection  in  N tries  Pd(N)  is  given  by 


P„(V)=l-[l-P„(l)f.  (11.30) 

If  Pp(N)  and  Pd(N)  are  specified,  the  values  of  Pf(1)  and  Pd(1)  required  to  produce  them  can  be 
determined  from 
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(11.31) 

i-p,(v)r. 

(11.32) 

For  Swerling  II  targets,  we  know  that 


Thus,  we  can  calculate  the  CNR  required  to  yield  specific  Pd(N)  and  Pf(N)  from  the  relation 


In 


CNR 


REQUIRED 


In 


i-[i-p,(iv)r 


4 - 1. 


(11.34) 


One-out-of-N  pulse  detection  is  simple  but  does  not  produce  anywhere  near  optimum 
performance.  A much  improved  approach  that  retains  most  of  the  simplicity  is  M-out-of-N  pulse 
detection.  In  this  approach,  N pulses  are  transmitted  and  the  number  of  threshold  crossings  is 
counted.  If  the  number  of  threshold  crossings  exceeds  a predetermined  number,  then  a detection  is 
declared.  If  fewer  threshold  crossings  are  detected,  then  no  declaration  is  made;  any  threshold 
crossings  are  assumed  to  have  been  due  to  noise. 

If  we  use  the  same  definitions  for  Pd(1)  and  Pf(1)  as  before,  then  the  probabilities  for  M-out- 
of-N  detection  can  be  determined  from  simple  probability  theory 


N 


P^(MofN)=  X 


N\ 


m=M 


m\  i^N -m) 


[p,(i)]"[i-p,(i)] 


N-m 


(11.35) 


PAMofN)=  ± 


—M  m\{N-m)\ 


(11.36) 


For  any  specific  problem,  one  might  simply  evaluate  Eqs.  (1 1.35)  and  (1 1.36)  using  a computer  for 
a variety  of  M and  N and  Px(l)  to  create  a table.  Once  the  desired  values  of  Pp(M  of  N)  and  Pd(M 
of  N)  are  established,  then  the  required  Pd(1)  and  Pp(l)  are  found  by  interpolation.  These  are 
converted  to  CNR  requirements  using  the  single-pulse  results  of  the  preceding  section. 

For  any  value  of  N,  there  is  an  optimum  value  of  M.  Use  of  non-optimum  M values  will 
require  significantly  larger  CNR  values  to  achieve  the  desired  probabilities.  Figure  11-10  shows  the 
optimum  value  of  M for  each  value  of  N.  For  large  N,  the  optimum  value  of  M is  approximately 
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N/4.  Figure  11-11  compares  the  performance  of  M-of-N  detection  to  the  optimum  pulse  integration 
scheme.  M-of-N  detection  is  seen  to  suffer  less  than  a 2 dB  penalty  relative  to  the  optimum  scheme. 


Figure  11-10.  Optimum  choice  of  M given  a specified  value  of  N. 


Figure  11-11.  Comparison  of  three  specific  M-of-N  integration  schemes  with 
optimum  pulse  integration  and  the  optimum  M-of-N  integration  scheme. 
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Example:  Turbulence  Effects  on  Laser  Radar  Detection 

In  Chapter  4 the  effects  of  atmospheric  turbulence  on  sensor  performance  were  hinted  at  but 
could  not  be  quantitatively  discussed.  Having  studied  the  effect  of  fluctuations  on  detection 
probability  we  are  now  in  a position  to  quantitatively  discuss  some  turbulence  effects  on  sensor 
performance.  Turbulent  scintillation  causes  amplitude  fluctuations  of  magnitude 


(N 

o 

II 

(11.37) 

where  % is  the  log-amplitude.  The  log-amplitude  is  described  by  a normal  (Gaussian)  distribution 
with  a mean  equal  to  minus  the  variance,  i.e., 

' 

(11.38) 

where  the  log-amplitude  variance  is  given  by 

(11.39) 

with 

k = 271  / A 

(11.40) 

and 

^ = z / R = Normalized  distance  along  path . 

(11.41) 

Combining  Eq.  (1 1.37)  with  the  integral  of  Eq.  (1 1.38)  we  can  determine  the  probability  that  the 
scintillation  can  produce  very  high  intensity  fluctuations.  This  is  shown  in  Figure  1 1-12.  It  is  easy 

to  see  that  near  saturated  scintillation  ( « 0.5)  intensity  spikes  greater  than  lOx  normal  can 

occur  as  much  as  1 % of  the  time.  Fades  in  intensity  are  even  deeper  and  more  frequent.  Turbulence 
fluctuations  should  be  expected  to  have  significant  impact  on  the  receiver  operating  characteristics 
of  systems  operating  in  the  visible  or  infrared.  Let  us  examine  turbulence  effects  on  coherent  laser 
radar  systems.  Except  for  the  turbulence  sensitivity,  laser  radars  will  behave  exactly  like  the 
microwave  radars  we  have  discussed  earlier. 

Let  us  consider  the  case  of  a non- fluctuating  target  (a  glint  target,  a specular  reflecting  target, 
or  a Swerling  0 target)  subject  to  turbulent  scintillation.  In  a radar,  the  scintillation  can  act  over  both 
the  transmitted  and  the  received  path.  The  intensity  statistics  are  then  given  by 

(11.42) 
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Figure  11-12.  Percentage  of  the  time  that  intensity  spikes  due  to  turbulent  scintillation 
exceed  given  limits  as  a function  of  turbulence  strength. 


INTENSITY  RELATIVE  TO  FREE  SPACE  (dB) 


with  given  by  Eq.  (11.38).  Inserting  these  results  into  Eq.  (11.23)  and  numerically  integrating  the 
result  yields  curves  of  the  form  of  Figure  11-13.  A relatively  complete  set  of  ROC  curves  may  be 
found  in  Reference  [4].  Comparing  Fig.  11-13  to  Figures  11-6  and  11-8  we  see  that  turbulence 
fluctuations  near  saturated  scintillation  requires  orders  of  magnitude  more  margin  for  adequate 
detection  than  does  Rayleigh  fading. 
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Figure  11-13.  Parametric  behavior  of  the  receiver  operating  characteristic  for  Pp  = 10  ’ 
showing  the  effect  of  increasing  turbulence  strength  on  normally  non-fluctuating  targets. 


If  we  wish  to  investigate  the  effects  of  turbulence  on  a Rayleigh- fading  target  things  get  a 
little  more  complicated.  First,  scintillation  which  occurs  on  the  transmitted  path  will  produce  an 
amplitude  modulation  at  the  target.  Since  Rayleigh-fading  targets  are  not  point  sources  (they  are 
by  definition  composed  of  a large  number  of  point  scatterers),  the  transmitter-to-target  path 
scintillation  will  be  partially  averaged  out  by  the  very  process  that  produces  the  Rayleigh  fading. 
This  aperture  averaging  alters  the  probability  distribution  of  fluctuations  to 

7,  (11.43) 

where  F is  a unit-mean,  exponentially  distributed  random  variable  with  distribution 
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Pv(V)  = 


-V 


(11.44) 


which  accounts  for  Rayleigh  fading  and  U is  a log-normal  variable  given  by 


Pu(U)  = 


U+a^ 

20-2 


(11.45) 


where  is  related  to  the  normal  log-amplitude  variance  by 


,4a" 


-1  = 4 


I6cri 


-1 


(11.46) 


with  i^being  an  aperture-averaging  factor  given  by 

[(t/VAZ)/[l  + (t/VAZ)]  d^>XLId 

^ |l/[l  + (4//lZ)]  d^<XLId 


(11.47) 


where  dj  is  the  target  dimension.  Figure  11-14  shows  the  effect  of  aperture  averaging  on  the  log- 
amplitude  variance  for  a range  of  aperture-averaging  factors. 

Substituting  these  results  into  Eq.  (11.23)  allows  numerical  computation  of  the  receiver 
operating  characteristics  of  a Rayleigh  fading  targets  subjected  to  additional  turbulent  scintillation. 
A sample  result  for  Pp  = lO'^  is  shown  in  Figure  11-15.  A complete  set  of  ROC  curves  for  a variety 
of  false  alarm  probabilities  and  turbulence  log-amplitude  variances  may  be  found  in  Ref  [4].  As 
with  non-fluctuating  targets  we  see  that  near  saturated  scintillation  enormous  CNR  margins  are 
required  to  give  adequate  detection  performance.  It  is  for  this  as  well  as  for  reasons  to  be  made  clear 
in  Chapter  14,  that  operation  anywhere  near  saturated  scintillation  is  to  be  avoided  if  possible.. 
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j 2 

Figure  11-14.  Aperture-averaged  log-amplitude  variance  & vs.  log-amplitude  variance  a ^ 
for  resolved  targets  with  various  transmitter  Fresnel  numbers  {d^/AL). 
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Figure  11-15.  Parametric  behavior  of  the  receiver  operating  characteristic  for  Pp  = 10  ’ 
showing  the  effect  of  increasing  turbulence  strength  on  normally  Rayleigh-fading  targets. 
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System  Level  Concerns  Involving  Detection 

The  probability  of  detection  and  probability  of  false  alarm  are  system  level  parameters.  That 
is,  they  are  usually  determined  by  mission  performance  considerations.  There  is  no  perfect  choice 
of  Pd  and  Pp.  Acceptable  values  are  determined  by  considering  the  penalties  that  each  choice 
imposes  on  the  system. 

A missed  detection  imposes  a penalty  on  the  system.  The  penalty  may  take  the  form  of  an 
increased  reaction  time  (because  additional  looks  at  the  scene  are  required  to  make  the  detection 
needed  to  start  any  reaction  process).  It  may  directly  result  in  increased  casualties.  It  might  require 
only  a single  additional  pulse  to  produce  a required  valid  range  estimate  (and  have  almost  no  impact 
on  the  timeline).  A missed  detection  might  also  result  in  total  annihilation  (e.g.,  if  the  missed 
detection  were  the  launch  of  a massive  nuclear  strike  or  the  approach  of  a single  biological  weapon 
containing  an  “Andromeda  Strain”  agent. [5]  Clearly  there  is  a huge  range  in  the  potential  penalties 
that  missed  detections  impose. 

Similarly,  there  is  a wide  range  in  the  penalties  imposed  by  a false  alarm.  The  penalty  might 
be  the  switching  of  a system  from  a search  mode  into  a tracking  mode  (with  consequent  loss  of 
search  capability  until  the  false  alarm  fails  to  recur  and  track  mode  is  canceled).  A false  alarm  might 
be  nothing  more  than  a minor  irritation  to  an  operator.  It  might  appear  as  “snow”  on  a display.  If 
the  operator  must  take  some  action,  then  the  false  alarm  might  become  a major  annoyance  if  it 
happens  too  frequently.  A false  alarm  might  lead  to  catastrophe  (e.g.,  if  the  false  alarm  were  the 
declaration  of  an  incoming  nuclear  strike  coupled  to  an  automatic  counterattack  response  - which 
would  of  course  trigger  the  real  launch  of  the  opposing  side’s  missiles). 

The  probabilities  of  false  alarm  and  detection  need  to  be  established  by  determining  the 
maximum  level  of  penalties  that  can  be  tolerated  and  determining  the  values  of  Pp  and  Pp  that 
correspond  to  those  maximum  tolerable  penalties.  This  will  result  in  the  lowest  acceptable  value 
of  Pq  and  the  highest  acceptable  value  of  Pp.  Combined  these  will  result  in  the  lowest  required 
carrier-to-noise  ratio.  Since  detection  probability  affects  required  CNR  more  strongly  than  false 
alarm  probability,  more  attention  should  be  paid  to  the  penalty  analysis  associated  with  missed 
detections  than  to  the  analysis  associated  with  false  alarms. 

As  an  example  of  how  this  might  be  done,  consider  a simple  laser  rangefinder.  The  device 
can  make  one  range  measurement  per  second  for  as  long  as  the  operator  pushes  the  fire  button.  Each 
range  measurement  has  a resolution  of  10  meters  and  the  range  is  sampled  continuously  over  the 
span  of  0 m to  1 0,000  m.  The  measured  range  is  displayed  visually  to  the  operator.  Over  the  course 
of  a year,  a typical  operator  of  this  device  will  only  use  the  rangefinder  about  100  times.  We  need 
an  estimate  of  the  required  and  Pp  to  include  in  a system  specification. 

What  are  the  penalties  associated  with  a missed  detection?  Basically,  if  the  operator  fails 
to  see  a valid  range  appear  on  the  display,  he  pushes  the  fire  button,  he  must  push  the  button  again. 
Failure  to  get  a valid  range  (i.e.,  the  system  misses  a detection)  will  cost  the  operator  little  more  than 
one  second  of  time.  In  one  second,  typical  objects  of  interest  will  have  moved  no  more  than  a few 
tens  of  meters.  This  is  insignificant.  On  the  other  hand  if  the  operator  misses  ten  measurements  in 


360 


a row,  a long  time  (10  seconds)  is  wasted,  and  the  target  situation  may  have  changed  unacceptably. 
A few  missed  detections  (perhaps  as  many  as  three)  in  a row  is  acceptable;  ten  is  not.  If  an 
unacceptable  number  of  missed  detections  happens  only  once  in  the  career  of  an  operator  he  will 
seldom  mind  - the  unacceptable  performance  is  not  only  rare,  but  it  is  much  more  likely  to  occur 
during  training  than  during  combat.  If  we  assume  the  operator  has  an  average  career  of  ten  years 
and  uses  the  rangefinder  100  times  per  year,  we  find  that  one  “run”  of  missed  detections  is 
acceptable  every  1000  uses.  If  we  say  an  unacceptable  run  is  three  missed  pulses  then 

P(3  misses)  = 1 - (l  - < 0.001  =>  « 0.90 

Thus  we  obtain  a result  that  states  that  = 0.90  is  almost  certainly  acceptable.  It  is  not  t^  right 
answer,  but  it  is  not  the  only  right  answer,  A second  individual  might  have  changed  the  initial 
assumptions  slightly  and  obtained  = 0.80  or  P^  = 0.95. 

What  are  the  penalties  associated  with  a false  alarm?  A false  alarm  will  manifest  itself  as 
an  incorrect  range  value  selected  randomly  from  all  of  the  possible  range  values.  The  user  almost 
always  has  a visible  check  on  the  rough  range  to  the  target.  From  size  alone  he  can  estimate  that  a 
tank  is  closer  to  1 km  range  than  to  10  km.  Thus  as  many  as  80%  of  all  false  alarms  will  be 
immediately  recognized  as  being  errors.  The  remaining  20%  are  too  close  (within  10%)  to  the  true 
value  to  be  recognized  as  erroneous.  If  the  rangefinder  is  being  used  to  direct  artillery,  indirect  fire 
weapons,  then  the  penalty  for  error  might  be  missing  the  target  and  wasting  a few  artillery  shells. 
If  the  rangefinder  is  being  used  to  determine  a fire  control  solution  for  a direct  fire  weapon,  there 
is  a good  chance  that  the  range  error  would  cause  a missed  target.  This  could  conceivably  result  in 
the  target  being  able  to  return  fire  and  kill  the  person  with  the  rangefinder.  This  is  a penalty  that  the 
observer  would  find  unacceptable.  Thus  it  is  desirable  that  the  probability  of  a false  alarm  occurring 
be  no  worse  than  one  “undetectable”  false  alarm  in  the  career  of  a user.  That  is,  no  more  than  1 false 
alarm  in  200  range  measurements.  Since  there  are  1000  possibilities  for  a false  alarm  to  occur  on 
every  laser  pulse  (10,000  range  span  divided  by  10  m resolution),  the  resultant  false  alarm 
probability  is 


Pp  = 1/(200  X 1000)  = 5x10-^ 

Thus,  a reasonable  set  of  requirements  is  P^  = 0.90  and  Pp  = 5 x KP’.  However,  another  systems 
engineer  might  have  come  with  different  numbers.  Correctness  is  entirely  in  the  eye  of  the  beholder. 
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Clutter  and  Clutter  Rejection 

Most  sensor  systems  are  sensitive  to  signals  of  one  form  or  another  that  are  produced  by 
objects  in  the  scene  that  are  neither  targets  nor  other  objects  of  interest.  Because  they  are  real 
objects,  they  produce  real  threshold  crossings  that  are  not  false  alarms.  They  will  recur  in  every 
scan.  These  signals  often  have  properties  that  make  them  difficult  to  distinguish  from  the  objects 
of  interest.  Collectively,  these  signals  are  called  clutter,  due  to  the  visual  effect  they  have  on 
detecting  targets  in  a radar  display. 

Clutter  signals  can  be  produced  by  almost  real  object  that  is  of  secondary  importance  or  less. 
Common  sources  of  clutter  include  emission  or  reflection  of  radiation  from: 

* Terrain  features 

* Birds,  animals,  or  insects 

* Clouds 

* Precipitation 

* Moving  foliage  or  water  waves 

* Junked  or  destroyed  vehicles  and  other  debris 

Some  of  these  objects  are  far  more  important  is  some  classes  of  sensors  than  in  others,  and  vice 
versa.  For  example,  birds  are  a serious  form  of  clutter  in  infrared  search  and  track  systems.  They 
look  and  act  just  like  aircraft  targets.  Water  waves  are  a serious  clutter  source  in  marine  radar 
systems. 

Clutter  signals  exceed  the  threshold  of  detection  of  a system  and  produce  “clutter  false 
alarms”.  Real  false  alarms  are  due  to  noise  and  are  random  occurrences.  They  do  not  recur  in  look 
after  look.  Clutter  false  alarms  are  due  to  real  objects  and  do  persist  over  many  observations.  The 
primary  deleterious  effects  of  clutter  false  alarms  are  that  they  attract  the  operator’s  attention  and 
may  cause  him  to  fail  to  detect  real  target  objects  the  sensor  detected.  They  make  tracking  more 
difficult  by  making  the  scene-to-scene  association  of  targets  more  difficult  (see  Chapter  13).  Their 
similarity  to  real  targets  often  precludes  automation  of  the  detection  process.  In  general,  it  is  entirely 
possible  for  clutter  to  degrade  system  performance  to  unacceptable  levels. 

Sensors  with  clutter  limitations  employ  a variety  of  techniques  collectively  known  as  clutter 
rejection  to  ameliorate  the  effects  of  clutter.  Among  the  techniques  that  might  prove  useful  in  any 
given  situation  include: 

* Raising  the  detection  threshold  (with  consequent  loss  of  real  detection  probability) 

* Constant  false  alarm  rate  (CFAR)  processing  (varying  the  detection  threshold  adaptively 
to  maintain  a constant  rate  of  threshold  crossings  - this  also  degrades  detection 
probability) 

* Moving  target  indication  (discrimination  based  on  target  motion) 

* Spatial  fdtering  of  image  data  (discrimination  based  on  target  shape  and  size) 

* Sensitivity  time  control  (emphasizes  distant  returns  versus  close-in  returns) 

* Optimized  waveforms 

No  clutter  rejection  technique  is  perfect.  Most  result  in  some  loss  of  detection,  but  less  than  occurs 
if  clutter  is  present.  Clutter  rejection  will  be  revisited  in  the  individual  sensor  discussions. 
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Problems 


11-1.  How  does  signal-to-noise  ratio  differ  from  carrier-to-noise  ratio? 

11-2.  Write  the  integral  expressions  for  false  alarm  probability  and  detection  probability? 

11-3.  What  is  a receiver  operating  characteristic  and  how  is  it  utilized? 

11-4.  Increasing  the  threshold  voltage  in  a detection  system  has  what  effects  on  detection 
probability  and  false  alarm  probability? 

11-5.  A system  specification  requires  certain  detection  and  false  alarm  probabilities.  The  initial 
hardware  concept  (as  sized)  does  not  have  adequate  CNR  to  produce  those  detection  and 
false  alarm  probabilities  with  single  pulses.  Without  abandoning  the  system  concept 
entirely,  what  options  are  available  to  you  to  provide  the  required  performance?  Hint:  there 
are  at  least  three  totally  unrelated  approaches  that  may  be  taken. 

1 1 -6.  In  the  system  of  problem  11-5,  which  option  (or  options)  is  likely  to  yield  the  best  (accept- 
able) solution  in  the  following  cases: 

a.  The  CNR  shortfall  is  3 dB? 

b.  The  CNR  shortfall  is  1 0 dB? 

c.  The  CNR  shortfall  is  30  dB? 

11-7.  A radar  was  designed  to  work  with  = 0.99  and  Pp  = 1 0 * against  a target  with  Swerling  2 
statistics.  Later  intelligence  exploitation  of  the  target  yields  the  fact  that  the  target  statistics 
are  in  fact  Swerling  4 even  though  the  estimated  cross  section  was  correct..  What  implica- 
tions does  this  discovery  have  on  required  CNR?  What  is  the  range  performance  implication 
of  this  discovery  assuming  the  target  remains  the  same?  Assume  that  for  this  radar 

CNR  X 1/R^ 

11-8.  A ground  surveillance  radar  for  armored  vehicles  uses  1 00-pulse  integration  to  provide  22.3 
dB  integration  improvement.  Estimate  the  miss  probability  of  this  radar.  If  CNR  = 2.1  dB 
yields  adequate  performance  after  integration  improvement,  what  is  the  approximate  false 
alarm  probability?  If  the  mean  time  between  false  alarms  is  one  day,  what  is  the  bandwidth 
of  the  radar  electronics?  If  we  lower  the  average  noise  by  10%,  what  will  be  the  new  mean 
time  between  false  alarms  (for  the  same  bandwidth  as  above)? 

11-9.  You  are  designing  a mid-range  air  search  track- while-scan  (TWS)  radar  for  littoral 
operations.  Intelligence  reports  that  a traditionally  belligerent  nation  has  developed  an 
effective  attack  technique  known  only  as  "flock  of  seagulls".  This  technique  uses  a medium- 
bomber  aircraft  that  launches  a mass  of  winged,  unguided  rockets  that  form  a swarm  around 
it  as  it  enters  the  SM-2  engagement  zone  of  our  ships.  Intelligence  suggests  that  the  purpose 
is  to  induce  cross  section  variations  that  are  equivalent  to  those  of  one  large  steady  return 
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and  numerous,  small  random  reflectors.  Given  a required  Pp  = 0.9  and  Pp  = 10  ®,  calculate 
the  CNR  required  for  the  TWS  radar  to  effectively  detect  the  "flock  of  seagulls".  If  9 pulses 
is  the  maximum  number  available  for  pulse  integration,  recalculate  the  CNR  required  if 
pulse  integration  is  used.  The  hostile  bomber  alone  obeys  Swerling  2 statistics.  If  "flock  of 
seagulls"  is  truly  designed  to  adversely  P^  and  Pp,  did  the  hostile  nation  make  a wise  choice 
in  deploying  "flock  of  seagulls"? 

11-10.  What  CNR  is  required  by  a radar  to  provide  a 90%  detection  probability  with  a 10  ® false 
alarm  probability  for  10  pulse  optimum  integration  assuming  the  target  obeys  Swerling  II 
statistics?  If  the  radar  bandwidth  is  10  MHZ,  what  is  the  mean  time  between  false  alarms? 

11-11.  Given  a sensor  noise  probability  density PfjiV)  and  a signal  fluctuation  probability  distribu- 
tion how  would  you  calculate  a set  of  receiver  operating  characteristic  curves  for  that 
sensor. 

11-12.  What  is  M-of-N  detection?  Why  would  a system  designer  consider  using  it?  What  penalty 
relative  to  a perfect  system  would  a system  using  M-of-N  detection  suffer? 

11-13.  A laser  radar  must  have  a detection  probability  of  0.999  against  a Swerling  II  (speckle) 
target.  The  radar  has  a 20  dB  margin  to  accommodate  a certain  amount  of  turbulent 
scintillation  degradation.  What  is  the  maximum  turbulence  log-amplitude  variance  that  will 
not  cause  an  unacceptable  degradation? 

11-14.  Determine  reasonable  values  of  detection  probability  and  false  alarm  probability  for  a simple 
laser  rangefinder  used  as  part  of  a tank  gun  fire  control  system.  The  rangefinder  is  capable 
of  making  1 Hz  range  measurements  indefinitely. 
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CHAPTER  12 


ESTIMATION 


Characteristics  of  Measurements 

Measurements  with  sensors  are  made  with  a purpose.  One  of  these  purposes  is  to  estimate 
the  value  of  some  parameter  of  interest.  For  example,  we  may  wish  to  estimate  the  range  to  a 
waypoint  in  a navigation  application  or  the  angular  size  of  a target  in  a target  recognition  applica- 
tion. There  is  a science  of  measurement.  Investigators  have  defined  a number  of  characteristics  of 
measurements  with  which  a systems  engineer  must  be  familiar.  In  this  section  we  will  look  at  some 
of  those  definitions. 

Accuracy  of  any  measurement  is  clearly  of  interest.  The  IEEE  Dictionary  [I]  defines 
accuracy  as: 

“the  quality  of  freedom  from  error  or  mistake.  ” 

Error  is: 

“any  discrepancy  between  a computed,  observed,  or  measured  quantity  and  the  true, 
specified,  or  theoretically  correct  value  or  condition.  ”[1] 

It  is  obvious  that  one  wants  measurements  to  be  as  accurate  as  possible.  This  requires  understanding 
and  controlling  the  sources  of  error.  There  are  different  kinds  of  errors.  A random  error  is: 

“a  component  of  error  whose  magnitude  and  direction  vary  in  a random  manner  in  a 
sequence  of  measurements  made  under  nominally  identical  conditions.  ”[1] 

Random  errors  may  arise  from  the  effects  of  sensor  noise,  from  imperfect  implementation  of  sensor 
components,  or  from  temporal  variation  of  systematic  errors.  A systematic  error  is: 

“the  inherent  bias  (offset)  of  a measurement  process  or  one  of  its  components  ”[I] 
while  a bias  error  is  defined  as: 

“a  systematic  error  whether  due  to  equipment  or  ambient  conditions.  [1] 

Bias  errors  can  result  from  imperfect  implementation  of  sensor  components,  imperfect  alignment 
or  calibration  of  sensor  components,  or  use  of  an  imperfect  “estimator”.  An  estimator  is  a 
processing  approach  to  making  an  estimate  of  some  value  of  a signal.  For  example,  an  estimator  of 
time  of  arrival  of  a pulse  is  to  set  a threshold,  measure  the  times  at  which  the  pulse  crosses  the 
threshold  (both  upward  and  downward  crossings),  add  the  results  and  divide  by  2.  Some  estimators 
are  biased.  That  is,  they  respond  differently  to  upward  noise  fluctuation  than  to  downward  noise 
fluctuations.  This  leads  to  a biased  estimate  with  the  magnitude  of  the  bias  being  a function  of  the 
relative  noise  level. 

Related  to  accuracy,  but  not  equivalent  to  it,  are  the  concepts  of  precision,  repeatability,  and 
uncertainty  of  measurements.  Precision  has  two  definitions.  One  is: 

“the  degree  of  exactness  or  discrimination  with  which  quantity  is  stated.  ”[1] 

The  second  is: 

“the  quality  of  coherence  or  repeatability  of  measurement  data.  ”[1] 
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Repeatability  is: 

“the  closeness  of  agreement  among  repeated  measurements  of  the  same  variable  under  the 
same  conditions. 

Uncertainty  is: 

“the  estimated  amount  by  which  the  observed  or  calculated  value  of  a quantity  may  depart 
from  the  true  value.  "[1] 

The  uncertainty  is  usually  given  as  the  standard  deviation  or  the  probable  error  of  the  random 
distribution  associated  with  the  observation  or  calculation. 

The  relationship  between  accuracy  and  precision  is  illustrated  in  Figure  12-1  using  the 
context  of  bullets  fired  at  a target.  The  upper  left  target  shows  a situation  that  is  neither  accurate  nor 
precise.  There  is  a substantial  bias  and  there  is  a large  uncertainty.  In  the  target  at  the  upper  right, 
the  bias  has  been  completely  eliminated.  This  might  seem  accurate,  but  there  is  still  a large 
uncertainty,  so  none  of  the  individual  measurements  is  likely  to  come  close  to  the  true  value 
(accuracy  is  the  freedom  from  error  - of  any  kind).  Thus,  the  situation  is  still  neither  accurate  nor 
precise.  The  situation  at  lower  left  is  precise  but  still  not  accurate.  The  uncertainty  has  been  largely 
eliminated,  but  a large  bias  remains.  Remember  that  accuracy  is  the  freedom  from  all  error.  You 
cannot  be  accurate  is  the  result  is  biased  or  has  a large  uncertainty.  The  target  at  lower  right  shows 
a situation  that  is  reasonably  accurate  and  precise.  Both  the  bias  and  the  uncertainty  are  reasonably 
limited. 


Figure  12-1.  Relationship  between  accuracy  and  precision. 
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Sensor  Resolution 


Another  property  of  measurements  that  deserves  considerable  study  is  the  resolution  of  the 
sensor  that  obtained  them.  The  dictionary  defines  resolution  as: 

“the  smallest  change  in  the  pulse  characteristic,  property,  or  attribute  being  measured 
which  can  be  unambiguously  be  discerned  or  detected  in  a pulse  measurement  process.  ”[1] 
However,  the  author  prefers  to  use  his  own  definition: 

“the  smallest  separation  in  the  variable  space  between  two  identical  point  objects  at  which 
the  presence  of  two  separate  signal  contributions  can  be  discerned  or  detected.  ” 
Resolution  should  be  distinguished  from  a quantity  called  resolvability.  Resolvability  is  defined 
by  the  author  as: 

“the  ability  of  a system  to  detect  or  discern  the  presence  of  individual  signal  contributions 
of two  objects  of  specified  (and  possibly  unequal)  physical  extent  and  specified  (and possibly 
unequal)  signal  strength  separated  in  the  variable  space  by  a specified  amount.  ’’ 

It  is  quite  possible  that  resolvability  is  the  desired  system  performance  value,  but  resolution  is 
mistakenly  specified  instead.  For  example,  if  the  problem  is  the  detection  of  a small  boat  alongside 
a large  ship,  then  specifying  resolution  may  not  guarantee  that  detection  is  possible.  Specifying 
resolvability  using  the  relative  signal  strengths  and  relative  physical  sizes  would  guarantee  that 
performance.  Resolution  is  relatively  easy  to  determine.  Resolvability  involves  multiple  parameters 
(not  just  separation)  and  is  considerably  more  difficult  but  not  impossible  to  evaluate. 

Angular  resolution  is  qualitatively  determined  by  moving  two  point  source  targets  apart  until 
the  superposition  of  the  two  point  source  response  functions  exhibits  a discernible  dip.  For  a sensor 
with  point  source  response  G( 9)  and  two  identical  point  sources  with  angular  separation  9,  the  ratio 
of  the  intensity  of  the  sum  response  mid-way  between  the  two  sources  to  the  sum  response  at  the 
position  of  either  source  defines  a resolution  function  R(9) 


R(d) 


MIDPOINT 


{0) 


^ SOURCE  POSITION 


{0) 


iG(en) 

G(0)  + G(<9) 


(12.1) 


The  angular  separation  producing  a specified  value  (determined  by  the  resolution  criterion  used  - 
see  below)  of  R < 1 is  the  sensor's  angular  resolution.  R( 9)  can  be  considered  to  be  a “resolution 
function”.  It  should  be  noted  that  this  analysis  is  perfectly  general  and  time  t could  easily  be 
substituted  for  (although  using  units  of  AJD  would  no  longer  be  appropriate). 

For  uniformly  illuminated  apertures  (e.g.,  as  found  in  most  kinds  of  passive  sensor),  the  point 
source  response  function  is  an  Airy  function  [2] 


Cairy{^) 
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(12.2) 


where  Jf  9)  is  the  Bessel  function  of  the  first  kind  of  order  1 . For  Airy  function  intensity  distribu- 


369 


tions,  Rayleigh  assumed  that  two  sources  would  be  resolved  when  the  peak  of  the  first  Airy  function 
coincided  with  the  first  zero  of  the  second  Airy  function.  At  this  point,  for  aperture  diameter  Z)  and 
wavelength  A,  the  angular  separation  is 

0=l.220A/ D = a (12.3) 

where  a is  the  angular  resolution  and  the  resolution  function  has  a value  of 

R^jj,y{l.220A/  D)^0.735  (12.4) 

This  Rayleigh  resolution  criterion  corresponds  to  a 26.5%  intensity  dip.  It  is  commonly  utilized 
when  describing  optical  sensors. 

Sparrow  assumed  that  two  sources  would  be  barely  resolved  when  the  sum  curve  was  flat 
at  the  midpoint.  In  this  case,  the  angular  separation  is 

0^1.007  A/D  (12.5) 

and  the  resolution  function  has  a value  of 

R^jj^y{l.007  A / D)  ^ 1.000  (12.6) 

The  Sparrow  resolution  criterion  corresponds  to  a 0%  intensity  dip.  It  is  commonly  used  when 
describing  microwave  sensor  systems  and  is  often  approximated  as  A/D. 

Laser  beams  typically  possess  Gaussian  intensity  distributions.  A sensor  might  have  a 
simple  Gaussian  point  source  response: 

(12.7) 

where  = TJtvwq  is  the  laser  beam  divergence.  A convenient  resolution  criterion  when  dealing  with 
Gaussian  distributions  is  to  assume  that  two  sources  are  resolved  when  the  peaks  of  the  two 
distributions  are  separated  by  an  amount  equal  to  the  full-width-at-half-maximum-intensity  of  one 
of  the  distributions.  It  is  easily  shown  that  this  produces 

0.5(2  In  2)'^"  ^0  = 0.588705^0  (12-8) 

and  a value 

^GAUSSIAN  FWHm{.^^  ~ (12.9) 

The  Gaussian  FWHM  resolution  criterion  produces  a 5.9%  intensity  dip.  Note:  when  comparing 
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resolutions  between  different  kinds  of  sensors,  it  is  important  to  use  the  same  resolution  criterion 
for  all. 


In  any  radar  the  point  target  response  function  must  contain  contributions  from  the 
transmitter  beam  profile  and  the  receiver  response  profile.  Since  the  total  response  profile  is  the 
product  of  two  profiles,  the  total  response  profile  will  be  narrower  than  either  the  transmitter  profile 
or  the  receiver  profile.  Thus,  for  a given  aperture,  an  active  sensor  will  have  higher  resolution  than 
the  comparable  passive  sensor.  Few  people,  including  many  sensor  designers,  are  aware  of  this  fact. 

In  a laser  radar,  both  transmit  and  receive  profiles  may  be  approximated  by  Gaussian 
profiles.  [3,4]  The  resulting  response  profile  is  given  by 


^squared-gaussianK^)^^  ^ ^ 


(12.10) 


where  Og  = AJttwo,  and  Wg  is  the  spot  size  at  the  Gaussian  beam  waist.  The  spot  size  is  usually 
adjusted  such  that  the  aperture  diameter  is  roughly  2 V2  (2  to  3)  times  the  spot  size.  With  this  choice, 
95%  of  the  beam  energy  passes  through  the  aperture,  side  lobes  are  small,  and  the  resolution  is  kept 
high.  Using  the  new  expressions  for  G( 6)  and  Wg,  the  Gaussian  FWHM  criterion  resolution  of  the 
laser  radar  is  found  to  be 


6>  = 0.5(ln2y'"6>o  = 0.416286>o  . 


(12.11) 


The  Gaussian  FWHM  criterion  still  corresponds  to  a 5.9%  dip.  In  terms  of  the  aperture-matched 
spot  size  we  can  define  the  angular  resolution  of  a laser  radar  as 


(In 2)'^"  A _ (Min 2) 


a = 


1/2 


^ = 0.26501  VM  — 


7T 


7T 


D 


D 


(12.12) 


where  M is  an  aperture  matching  criterion  (the  Gaussian  spot  size  Wg  must  take  some  fixed 
relationship  to  the  aperture  diameter/).  We  have  defined  the  matching  as  Z)  = (M)‘^^Wg.  Mtypically 
takes  on  a value  between  4 and  9.  Using  a conservative  value  of  M = 5,  we  calculate 


a « 0.74956— 
D 


(12.13) 


The  Rayleigh  criterion  for  the  squared-Gaussian  profile  becomes 

a = 0.892  2 /D 


(12.14) 


while  the  Sparrow  criterion  for  the  squared-Gaussian  profile  becomes 
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a = 0.703;i/Z). 


(12.15) 


It  is  relatively  easy  to  evaluate  the  resolution  function  for  a simple  Gaussian  profile.  If  we 
assume  the  same  aperture  matching  condition  as  above  {D  = then 


a = 0.994  A,  ! D Sparrow 


(12.16) 


a = 1.060  A / D Gaussian  FWHM 


a — 1.261/1  / D Rayleigh 


(12.17) 

(12.18) 


It  is  equally  easy  to  evaluate  the  resolution  function  for  a squared- Airy  function 
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(12.19) 


to  yield 


a = 0.707/1/1) 

Sparrow 

(12.20) 

a^O.lA^A!  D 

Gaussian  FWHM 

(12.21) 

ct  = 0.878/1/7) 

Rayleigh 

(12.22) 

The  preceding  results  are  illustrated 

in  Figure  12-2. 

The  resolution  of  a sensor  is  often  estimated  from  one  of  the  equations  above.  However, 
there  is  little  standardization  in  the  definition  of  resolution.  That,  in  fact,  is  why  the  author 
introduced  a well-defined  mathematical  resolution  function.  When  dealing  with  sensor  resolution, 
one  should  always  attempt  to  ascertain  what  type  of  point  source  response  has  been  assumed  (Airy, 
squared-Gaussian,  or  something  else)  and  what  resolution  criterion  (Sparrow,  Rayleigh,  Gaussian 
FWHM,  or  something  else).  When  writing  a specification  on  a system  in  which  resolution  is 
important,  the  specification  should  explicitly  state  the  assumptions  to  be  used.  When  examining 
vendor  offerings  one  should  bear  in  mind  that  vendors  will  almost  always  attempt  to  present  their 
results  in  the  best  light.  If  you  ask  for  the  resolution,  they  will  say  it  is  x milliradians.  They  may 
be  using  the  Sparrow  criterion  (an  optimistic  criterion  that  leads  to  small  values  of  resolution)  while 
you  may  be  assuming  the  Rayleigh  criterion  (a  very  pessimistic  criterion).  The  20%  difference 
between  these  criteria  could  easily  result  in  inadequate  system  performance.  If  you  don’t  require 
disclosure  of  the  details,  you  can  be  very  disappointed  with  what  you  actually  get. 
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Figure  12-2.  Resolution  function  dependence  on  angular  separation. 


In  estimating  the  resolution  of  a sensor  you  begin  with  the  point  source  response  function. 
In  general  this  is  a complicated  function  with  no  good  analytical  fit.  However,  from  Figure  12-2, 
we  see  that  there  is  little  real  difference  (a  few  percent  at  most)  between  the  results  for  two  radically 
different  active  sensor  response  functions  (squared- Airy  vs.  squared-Gaussian).  There  is  very  little 
difference  between  the  curves  for  the  two  passive  sensor  response  functions  (Gaussian  vs.  Airy). 
There  is  considerable  difference  between  the  active  and  passive  sensor  response  functions.  [4]  Thus, 
if  a detailed  response  function  is  not  available  determine  first  whether  the  sensor  is  active  or  passive 
and  then  estimate  whether  the  result  is  more  likely  to  be  approximated  by  Airy  or  Gaussian  behavior. 
The  resulting  choice  will  give  an  estimate  of  limiting  resolution  that  will  be  reasonably  accurate. 
Next,  a resolution  criterion  should  be  selected.  Your  own  leanings  towards  pessimism  or  optimism 
should  guide  this  choice.  However,  this  should  be  tempered  by  the  fact  that  picking  a criterion  will 
drive  the  physical  size  of  the  aperture  of  the  sensor  required  to  meet  it.  The  20%  difference  between 
Sparrow  and  Rayleigh  criteria  means  a 20%  difference  in  aperture  size.  This  has  potentially  major 
impacts  on  size,  weight,  and  cost  of  the  total  system.  The  author  usually  prefers  the  Gaussian 
FWHM  criterion.  It  is  reasonably  optimistic  but  not  overly  so. 

In  the  preceding  paragraph  the  author  referred  to  a limiting  resolution  of  a system.  The 
physics  of  diffraction  imply  that  resolution  cannot  exceed  x A/D  where  x is  of  order  unity.  However, 
other  design  considerations  can  lead  to  actual  resolutions  that  are  much  larger  (remember,  when 
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dealing  with  resolution,  large  is  bad)  than  the  limiting  values.  For  example,  any  defocus  of  an 
optical  system,  any  pointing  jitter,  any  turbulence  beam  spreading,  any  narrowband  filtering  of  a 
detector  output,  any  spatial  quantization  imposed  by  a detector  array,  will  result  in  image  blurring 
and  poorer  resolution.  In  later  chapters  we  will  show  how  to  explicitly  handle  these  effects. 

However,  in  real  sensor  systems,  one  need  not  calculate  the  resolution,  one  can  measure  it. 
This  is  done  by  viewing  a resolution  chart.  A resolution  chart  is  a graphic  element  that  has  regions 
of  light  and  dark  of  varying  sizes,  shapes,  and  separations.  It  is  viewed  by  the  sensor  under  test  and 
the  smallest  scale  that  still  yields  acceptable  “resolution”  based  on  a pre-selected  resolution  criterion 
is  the  effective  resolution  of  the  sensor.  There  are  many  standardized  resolution  charts  that  are 
available  commercially  for  sensor  testing.  It  is  also  possible  to  generate  your  own.  Figure  12-3 
shows  a resolution  chart  the  author  generated  for  an  experiment.  It  is  organized  based  on  four-bar 
patterns  of  effectively  unit  contrast  such  that  horizontal  and  vertical  resolutions  can  be  testing 
separately.  When  the  full-sized  chart  was  placed  at  1 meter  from  the  subject,  the  resolution  could 
be  read  directly  in  mm/mrad  from  the  numbers  beside  each  pattern.  Please  notice  that  after  printing, 
scanning,  reducing,  and  printing  in  this  copy  of  the  book,  the  resolution  had  degraded  until  the 
smallest  patterns  are  no  longer  resolvable  as  four  bars.  The  smallest  pattern  that  can  be  discerned 
as  four  bars  defines  the  effective  resolution. 


Figure  12-3.  A simple  four-bar  resolution  chart  generated  by  the  author. 
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Having  discussed  both  precision  and  resolution,  and  noting  that  many  individuals  use  resolution 
when  they  mean  precision,  it  is  worthwhile  to  directly  compare  the  two  quantities.  This  is  done 
graphically  in  Figure  12-4.  Remember  that  resolution  involves  two  objects.  When  the  objects  are 
separated  by  sufficient  distance,  their  superimposed  responses  will  exhibit  a significant  dip.  For 
Gaussian  responses,  a separation  equal  to  the  full- width-at-half-maximum  gives  a clearly  discernible 
dip.  Precision  involves  only  one  signal  and  describes  our  ability  to  precise  locate  the  response  in 
time  due  to  noise  superimposed  on  the  signal.  The  figure  shows  a Gaussian  signal  with  added 
sinusoidal  noise.  With  the  chosen  relative  phase  for  the  noise  note  that  the  centroid  of  the  sum  is 
displaced  by  an  amount  <5T  from  the  true  centroid.  If  the  noise  were  to  be  selected  to  have  a phase 
180  degrees  different  from  that  depicted,  the  sum  would  jump  to  the  other  side  of  the  true  value. 
When  all  possible  phases  of  the  noise  are  averaged,  a root-mean-square  deviation  that  is  not  zero 
will  exist.  This  is  the  precision  of  the  measurement  - the  fundamental  limit  on  the  accuracy  with 
which  any  set  of  measurements  can  be  made.  There  is  no  fundamental  physical  connection  between 
resolution  and  precision.  However,  as  we  shall  see  later,  both  resolution  and  precision  depend  on 
the  characteristic  time  scale  of  the  signal  and  will  have  a definite  mathematical  relation  to  each 
other. 


Figure  12-4.  Graphical  representation  of  the  difference  between  resolution  and  precision. 
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Parameter  Estimation 


Estimating  the  value  of  a target  parameter  (such  as  range  or  velocity)  from  a signal  corrupted 
by  noise  is  the  second  “elemental”  sensor  function  that  finds  frequent  utilization  in  higher-level 
sensor  functions.  In  most  cases,  parameter  estimation  devolves  to  the  process  of  estimating  the 
location  (value  in  space  or  time)  of  the  center  of  an  amplitude  distribution  related  in  some  well- 
defined  way  to  the  target  parameter  of  interest.  Often  the  task  is  equivalent  to  estimating  the  time 
at  which  the  center  of  a pulse  occurs  (most  estimation  problems  can  be  transformed  into  such  a pulse 
timing  problem). 

The  simplest  estimator  of  pulse  center  is  to  measure  the  time  at  which  the  peak  of  the  pulse 
occurs  and  declare  this  time  to  represent  the  pulse  center.  Unfortunately,  this  estimator  is  prone  to 
producing  significant  errors  at  low  carrier-to-noise  ratios  and  may  occasionally  generate  anomalous 
estimates  (large  errors  due  to  random  noise  spikes)  even  at  moderate  carrier-to-noise  ratios. 

A more  accurate  estimator  is  determination  of  the  centroid  of  the  pulse.  The  centroid  could 
be  calculated  digitally  after  sampling  the  pulse  at  high  speed,  or  it  could  be  determined  in  an  analog 
fashion  by  dividing  the  pulse  into  two  equal  copies,  delaying  one  copy  with  respect  to  the  other 
integrating  the  undelayed  copy,  setting  a threshold  at  half  the  amplitude  of  the  integrated  pulse, 
integrating  the  delayed  copy  and  determining  the  time  at  which  the  integrated  delayed  waveform 
crosses  the  threshold. 

A simple  yet  surprisingly  accurate  estimator  of  the  pulse  center  is  to  threshold  the  pulse, 
determine  the  times  at  which  the  rising  and  falling  threshold  crossings  occur,  and  declaring  the  time 
midway  between  the  threshold  crossings  as  the  pulse  center.  This  is  the  estimator  we  will 
investigate  in  more  detail. 

Other  estimators  can  be  constructed.  However,  all  of  them  will  produce  estimates  that  are 
affected  by  noise  on  the  pulse.  Barring  the  production  of  anomalies  by  some  estimators,  the  primary 
effect  of  noise  on  all  estimators  is  to  reduce  the  precision  of  their  estimates.  The  behavior  of  the 
CNR-dependence  of  the  precision  will  be  comparable  to  that  of  the  rising- falling  threshold  crossing 
estimator  to  be  analyzed  below. 

Noise  affects  the  ability  to  estimate  the  time  of  a threshold  crossing.  Consider  a voltage 
pulse  of  rise  time  rand  peak  amplitude  A corrupted  by  noise  of  root-mean-square  (rms)  voltage  Vg. 
Consider  first  the  rising  portion  of  the  pulse  as  illustrated  in  Figure  12-5.  Please  note  that  the  rise 
time  (from  minimum  to  maximum)  is  essentially  equal  to  the  pulse  duration  if  rise  and  fall  times  are 
equal.  If  the  bandwidth  of  the  noise  B is  roughly  equal  to  the  bandwidth  of  the  signal  (=  1/t)  then 
noise  can  be  considered  to  roughly  constant  over  the  first  half  of  the  signal  and  roughly  constant  (but 
different)  over  the  second  half  This  is  an  acceptable  assumption  because  the  random  phasing  of 
changes  in  noise  amplitude  is  accommodated  by  our  use  of  only  rms  values  of  noise  in  the  equations. 
The  nominally  constant  noise  voltage  will  add  to  the  leading  half  of  the  signal  voltage.  If  an 
arbitrary  threshold  has  been  set  somewhere  between  the  minimum  and  maximum  signal  values,  then 
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Figure  12-5.  Geometric  model  for  analysis  of  precision  in  estimation  processes. 
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the  noise  causes  a time  shift  in  the  threshold  crossing.  Since  the  noise  voltage  has  been  assumed  to 
be  an  rms  value,  the  change  in  threshold  crossing  time  will  also  be  an  rms  quantity.  From  the 
geometry  of  Figure  12-5  we  can  immediately  calculate  the  rms  time  shift  to  be 

tV 
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Now  consider  the  falling  threshold  crossing.  Obviously,  the  mathematical  result  for  the  rms  time 
shift  will  be  the  same.  However,  because  we  have  assumed  a particular  phase  for  the  noise  that  lead 
to  a constant  value  over  the  rising  half  of  the  pulse,  the  time  shift  for  the  falling  half  of  the  pulse  will 
be  uncorrelated  with  the  time  shift  for  the  rising  half  The  rms  error  in  estimating  the  center  of  the 
pulse  by  adding  the  times  and  dividing  by  two  will  therefore  depend  on  the  root-sum-square  (rss) 
of  the  two  threshold  crossing  time  shifts. 
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If  we  recall  the  definition  for  the  carrier-to-noise  ratio  of  a simple  radar  (from  the  previous  chapter) 


CNR  = A^  / 


(12.25) 


then  the  rms  centroid  error  can  be  written  as 
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(12.26) 


~ 4icm  ^ B^ICNR  ~ ^ICNR  ' 

The  last  equality  in  Eq.  (12.26)  uses  the  basic  equivalence  of  the  pulse  duration  rand  the  resolution 
AT  using  a small-dip  resolution  criterion  such  as  the  Gaussian  FWHM  criterion.  There  is  no  other 
fundamental  connection  between  resolution  and  precision. 

Using  Equation  (12.26)  as  the  basis  for  an  analogy,  we  may  calculate  the  precision  with 
which  a number  of  other  parameters  can  be  estimated.  The  results  are  compiled  in  Table  12-1.  In 
almost  every  instance  we  find  that  mathematically  (if  not  physically)  precision  (&)  can  be  related 
to  resolution  (Ax)  by  the  square  root  of  CNR. 

Ax 

Sx-  , (12.27) 

^2CNR 

This  is  an  extremely  useful  result,  because  it  is  one  of  the  few  equations  that  links  system  level 
performance  parameters  (Sc  and  Ax)  to  a component  level  performance  parameter  (CNR). 


Table  12-1.  Precision  (5x)  versus  resolution  (Ax)  for  a number  of  commonly  used  parameters. 
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Discrimination 


The  term  discrimination  has  many  meanings.  One  of  the  common  uses  is  the  determination 
of  whether  an  object  is  a true  target  or  a decoy  based  on  estimates  of  some  of  the  object’s 
parameters.  For  example,  decoys  tend  to  decelerate  faster  than  true  targets.  Thus,  an  estimate  of 
acceleration  might  be  used  to  declare  the  object  as  target  or  as  decoy.  The  problem  with  this  simple 
approach  is  that  there  is  seldom  a clear-cut  dividing  line  between  the  properties  of  decoys  and  the 
properties  of  targets.  Some  targets  might  decelerate  faster  than  some  decoys  do.  Furthermore,  we 
cannot  measure  the  deceleration  with  perfect  precision.  Measurement  precision  will  give  a spread 
to  the  values  in  a class  even  if  there  were  normally  not  such  a spread  in  the  absence  of  measurement 
“noise”.  Any  overlap  between  values  that  the  two  classes  may  possess  makes  the  assignment  to  a 
specific  class  subject  to  errors.  Let  us  consider  two  classes  each  having  a distribution  of  allowable 
values  of  some  parameter  (such  as  deceleration)  as  shown  in  Figure  12-6.  Assume  that  the 
distributions  of  allowable  parameter  values  are  Gaussian  (the  normal  probability  distribution).  That 
is,  the  probability  of  having  value  y is  given  by 


1 


(12.28) 


is  the  mean  of  the  distribution  for  class  n and  cr/  is  the  variance  of  the  distribution  of  that  class. 


Figure  12-6.  Errors  in  discrimination  between  two  object  classes. 
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Two  kinds  of  errors  can  be  made  in  assigning  a target  to  one  of  two  class.  Objects  in  Class 
1 can  be  mistakenly  assigned  to  Class  2 (called  a Type  I error)  and  objects  in  Class  2 can  be 
mistakenly  assigned  to  Class  1 (called  a Type  II  error).  If  a discriminant  “function”  is  picked  such 
that  the  class  is  assigned  depending  on  the  value  of  that  function  relative  to  the  estimated  target 
parameter,  we  have  the  basis  for  an  automatic  discrimination  system.  In  the  case  of  our  one- 
dimensional example  above,  the  discriminant  function  is  a threshold  value  y = X.  If  the  estimated 
parameter  value  is  greater  than  X,  the  object  is  assigned  to  Class  2.  If  the  estimated  parameter  value 
is  less  than  X,  the  object  is  assigned  to  Class  1.  For  this  discriminant  function,  the  probability  of 
making  a Type  I error  is  given  by  the  integral 
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while  the  probability  of  making  a Type  II  error  is  given  by  the  integral 
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These  equations  can  be  cast  in  slightly  simpler  form  by  transforming  the  integration  variable  from 

y toz=y/cr„ 
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(12.32) 


where  we  have  defined  the  integral  over  the  tail  of  the  normal  probability  distribution  as  Q{x) 

= (12.33) 


Note  that  we  have  also  used  the  symmetry  of  the  integral  about  the  mean  value  to  make  the  forms 

of  the  integral  equal  in  Equations  (12.31)  and  (12.32). 

If  Type  I and  Type  II  errors  are  equally  bad  then  the  optimum  solution  is  to  choose  X such 
that  p^X)  and  pjjX)  are  equal.  This  minimizes  the  total  error  probability.  In  this  instance  we  have 
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(12.34) 
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There  are  instances  in  which  Type  I and  Type  II  errors  are  not  equally  bad  and  there  are  other 
instances  in  which  they  are  equally  bad.  For  example,  it  is  much  more  serious  to  classify  an 
incoming  nuclear  reentry  vehicle  as  a decoy  (and  ignore  it)  than  it  is  to  classify  a decoy  as  a reentry 
vehicle  (and  attack  it)  if  there  is  an  ample  supply  of  interceptors  available.  However,  if  there  is  only 
one  interceptor  available  per  incoming  reentry  vehicle,  then  either  type  of  error  becomes  equally  bad 
(whether  you  ignore  the  real  warhead  or  attack  the  decoy  and  preclude  attacking  a real  warhead  you 
still  have  a warhead  that  is  not  attacked). 

If  we  substitute  the  expression  for  X^g  into  the  probability  expressions  we  find  that 

//2Cri+//iCT2 
0-1+ 0-2 

where  we  have  defined  a factor  K such  that 

^ ^^2.36) 

(71+0-2 

Since  this  factor  relates  the  separation  of  the  means  in  terms  of  the  standard  deviations,  it  is  clearly 
related  to  discriminability.  The  discriminability  factor  {K)  used  here  differs  slightly  from  that  used 
in  other  less  general  work  by  roughly  a factor  of  2.  This  other  work  [6]  defines  K using  only  one 

of  the  standard  deviations  (or  assumes  the  two  standard  deviations  are  equal).  In  that  work  (the 

prime  is  added  to  distinguish  this  result  from  that  of  Eq.  (12.36)  and  is  absent  in  the  original)  is 
given  by 

(12.37) 

(7 

If  the  standard  deviations  are  roughly  equal,  the  data  in  Figure  12-7  may  be  used  by  assuming 

K'  = KH.  (12.38) 

For  example,  the  = 1.5  line  corresponds  to  = 3.0  in  the  earlier  work. 


[cT.+CTi)  (12.35) 


Pi{^5o)  = Pii(^5o)=Q 
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Figure  12-7.  Relation  of  Type  I and  Type  II  errors  to  the  discriminability  factor  K. 


In  Figure  12-7  we  have  plotted  the  values  of p^X)  and  PjJiJC)  for  several  half-integer  values 
of  K.  The  lines  of  -45°  slope  show  the  probability  variations  for  specific  values  of  K.  These  show 
how  Type  I errors  may  be  traded  against  Type  II  errors  and  vice  versa.  The  line  of  +45°  slope  is  the 
line  for  equal  probability,  \.c.,p  = Q{K).  A value  of  1 .5  will  keep  the  probabilities  of  both  kinds 
of  error  less  than  10%  while  a value  of  > 2.5  will  keep  the  probabilities  of  both  kinds  of  error 
below  1%. 

The  analysis  above  is  one-dimensional,  that  is,  a single  parameter  is  used  to  discriminate 
between  the  classes.  This  analysis  can  be  extended  to  discrimination  based  on  multiple  parameters, 
although  we  will  not  do  so  here.  In  some  instances,  an  appropriate  linear  combination  of  the 
parameters  can  be  chosen  that  reduces  the  multi-dimensional  discrimination  problem  back  to  a one- 
dimensional problem  and  the  preceding  analysis  can  be  used.  When  the  number  of  parameters  being 
used  for  discrimination  and/or  the  number  of  classes  becomes  large,  the  process  becomes  what  is 
commonly  called  statistical  pattern  recognition. 
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Pattern  Recognition 

The  principal  components  of  the  statistical  pattern  recognition  process  are  summarized  in 
Figure  12-8.  Sensor  data  are  processed  through  a variety  of  operations  to  extract  key  elements  of 
information  called  features.  Parameter  estimation  plays  a role  in  establishing  values  for  many 
features  or  the  components  from  which  a feature  is  calculated.  Features  may  be  related  to  physical 
characteristics  - brightness,  color,  size,  weight,  etc. ; waveform  characteristics  - pulse  duration,  pulse 
repetition  frequency,  modulation  type,  jitter  in  pulse  duration,  jitter  in  pulse  repetition  rate,  etc.;  or 
image  characteristics  - area,  circumference,  aspect  ratio,  texture,  moments  of  the  intensity 
distribution,  etc.  The  collection  of  values  corresponding  to  all  (total  number  of  features  = N)  of  the 
identified  features  forms  a feature  vector  in  an  N-dimensional  feature  space.  When  numerous 
known  examples  from  the  various  target  classes  (total  number  of  classes  = M)  are  observed  and 
processed,  the  feature  vectors  from  all  examples  of  a target  class  tend  to  cluster  in  a region  of  feature 
space  that  is  different  from  the  regions  where  examples  from  other  target  (or  decoy  or  clutter) 
classes  tend  to  cluster.  Through  the  process  of  training  the  classifier  (through  repeated  observations 
of  known  targets,  decoys,  and  clutter  objects),  the  feature  space  can  be  divided  into  at  least  M 
regions  separated  by  N-dimensional  surfaces  called  decision  surfaces.  Ideally  all  objects  belonging 
to  one  class  would  fall  into  one  and  only  one  of  the  M regions  and  all  objects  belonging  to  other 
classes  would  fall  into  other  regions.  An  unknown  obj  ect  would  then  be  assigned  a class  designation 
based  on  which  region  of  feature  space  into  which  its  feature  vector  fell.  This  clear-cut  situation  is 
illustrated  in  the  left-hand  part  of  Figure  12-9.  Unfortunately  the  real  world  is  seldom  so  simple. 
Objects  belonging  to  one  class  do  tend  to  cluster  in  feature  space,  but  the  N-dimensional  probability 
distribution  will  have  some  overlap  with  the  probability  distributions  corresponding  to  other  objects. 
The  similarity  to  the  two-class  discrimination  problem  presented  in  the  preceding  section  should  be 
obvious.  In  the  training  mode  each  decision  surface  is  adjusted  to  minimize  the  total  number  of 
errors  of  Type  I and  Type  II  for  every  pair  of  target  classes.  In  a multiple  class  problem  it  is 
assumed  that  no  one  class  is  more  important  than  any  other  class.  The  decision  surfaces  may  be 
piecewise  planar  or  quadratic  (paraboloidal)  in  nature.  If  piecewise  planar  decision  surfaces  are 
assumed,  then  pair-wise  between  classes,  the  problem  usually  reduces  to  the  simple  discrimination 
problem.  The  analysis  for  quadratic  decision  surfaces  is  somewhat  more  complicated  and  is 
reserved  for  texts  on  pattern  recognition.[7]-[9] 


Figure  12-8.  The  operation  of  a statistical  pattern  recognizer.  The  classification  step  has 
a training  mode  in  which  known  inputs  are  processed  and  decision  surfaces  established  and 
an  assignment  mode  in  which  unknown  objects  are  classified  based  on  their  feature  vectors 

and  the  decision  surfaces. 
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Figure  12-9.  Ideal  and  realistic  clustering  of  objects  in  feature  space 
with  decision  surfaces  located  as  indicated. 


It  might  seem  to  be  a given  that  it  would  be  easy  to  define  enough  features  to  permit  target 
classification  with  minimal  errors.  As  hinted  at  in  Figure  12-10,  there  is  a fundamental  belief  that 
if  N-1  features  do  not  provide  the  ability  to  truly  discriminate  between  classes,  then  adding  the  A* 
feature  may  finally  permit  the  necessary  discrimination.  This  has  fueled  the  development  of  larger 
and  larger  classifiers  capable  of  handling  expanded  feature  spaces.  It  has  fanned  the  creative  spark 
in  defining  ever  more  clever  features  to  be  extracted.  It  has  also  led  to  the  expenditure  of  vast  sums 
to  assemble  multisensor  systems  capable  of  providing  still  more  features  to  incorporate  into  the 
feature  space. 

Unfortunately  reality  has  not  always  followed  prediction.  [10]  Although  improvements  in 
automatic  target  recognition  have  occurred,  few  systems  have  performance  adequate  for  incorpora- 
tion into  deployed  systems.  There  are  a number  of  reasons  for  this.  First,  not  all  features  provide 
the  high  degree  of  clustering  desired.  The  features  may  lack  discriminability.  It  is  possible  to  define 
mathematical  features  (for  example  the  eleventh  moment  of  an  intensity  distribution)  that  may  have 
no  connection  to  any  physically  observable  property.  It  may  have  more  correlation  with  sensor 
noise 


Figure  12-10.  How  adding  an  additional  feature  may  permit  discrimination  between  classes. 
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than  with  some  fundamental  target  property.  Other  features  are  overly  sensitive  to  environmental 
or  operational  ehanges.  For  example,  the  location  and  intensity  of  the  brightest  spot  in  a thermal 
image  may  depend  on  whether  the  sun  is  shining,  the  direction  of  the  solar  illumination,  whether  the 
engine  is  running,  or  how  many  rounds  its  main  weapon  has  fired.  Such  sensitivity  significantly 
broadens  a potentially  useful  feature.  Lastly,  there  may  not  be  sufficient  information  present  in  the 
data  to  permit  adequate  discrimination.  As  is  discussed  in  the  next  chapter,  there  appears  to  be  a 
correlation  between  extractable  information  content  in  “images”  and  the  ability  of  humans  to 
perform  perceptual  functions  such  as  target  classification.  If  the  signal  used  as  the  input  to  the 
classifier  does  not  contain  enough  information  to  permit  classification  to  be  accurately  performed, 
then  no  amount  of  processing  or  no  number  of  additional  features  will  ever  create  the  missing 
information.  To  date,  very  little  study  of  the  information  content  provided  to  target  recognizers  has 
been  performed  to  determine  how  well  they  should  perform  given  the  inputs  they  are  being  provided. 
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Problems 


12-1.  Compare  accuracy,  precision,  and  resolution. 

12-2.  You  are  the  marketing  representative  for  a radar  manufacturer.  You  wish  to  express  the 
angular  resolution  of  your  radar  in  the  best  possible  light.  What  resolution  criterion  are  you 
likely  to  use  to  calculate  the  angular  resolution?  Give  both  the  criterion  name  and  the 
explicit  expression  for  the  resolution. 

12-3.  A sensor  has  a point  source  response  that  is  triangular.  That  is, 

G(x)  = l-(|x|A:(,)  for  |x|  < 

= 0 for  |x|  > Xf) 

Evaluate  the  resolution  function  for  this  point  spread  function  and  develop  an  expression  for 
the  Rayleigh  criterion  of  this  point  spread  function  in  terms  of  X(,. 

12-4.  If  an  angular  resolution  criterion  is  assumed  to  have  the  form  a = x.xx  XfD,  what  are 
reasonable  values  of  x.xx  (pick  a specific  criterion)  for: 

1)  a thermal  imager 

2)  a microwave  radar 

3)  a laser  radar 

4)  a television  imager 

5)  a passive  sonar 

6)  a RF  interferometer 

7)  a laser  rangefinder? 

EXTRA  CREDIT 

8)  an  active  sonar 

9)  image-intensified  optics 

10)  a synthetic  aperture  radar 

12-5.  How  is  the  precision  of  a measurement  related  to  the  resolution  or  the  characteristic  value 
of  the  parameter  being  measured? 

1 2-6.  A spectrum  analyzer  designed  to  measure  the  frequency  of  a radio  emission  has  a frequency 
resolution  Av.  What  expression  describes  the  frequency  measurement  precision  5v  of  this 
spectrum  analyzer? 

12-7.  An  application  requires  angular  resolution  better  than  0.8  ^/D  and  an  even  finer  precision 
of  0.01  A,/D.  Which  sensor  defined  by  what  resolution  criteria  can  achieve  this:  i)  a laser 
radar  with  Sparrow  criterion,  ii)  a passive  infrared  detection  system  with  Sparrow  criterion, 
or  iii)  a laser  radar  with  Rayleigh  criterion.  What  is  the  relation  between  resolution  and 
precision?  What  must  be  done  to  the  sensor  chosen  above  to  satisfy  both  criteria? 
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12-8.  A new  antiship  missile  defense  system  is  required  to  track  missiles  with  an  active  300  GHz 
seeker  head.  The  system  requires  an  angular  resolution  of  less  than  0.005  radians.  If  two 
incoming  missiles  are  side-by-side  and  separated  by  35  meters  at  what  range  will  they 
become  resolved.  Utilizing  the  Rayleigh  criterion  determine  the  effective  antenna  size. 
Additionally,  the  system  requires  an  angular  precision  of 0.0005  radians.  Estimate  the  CNR. 

12-9.  A millimeter- wave  and  a microwave  radar  are  candidates  for  the  tracking  radar  of  a new 
class  of  ship.  The  millimeter-wave  radar  operates  at  220  GHz  and  has  an  antenna  size  of  55 
cm.  The  microwave  radar  operates  at  17  GHz  and  has  an  antenna  size  of  2.3  m.  What  is  the 
maximum  range  at  which  each  candidate  can  discriminate  between  two  targets  20  m apart. 
Assume  a Rayleigh  criterion  for  resolution  and  that  atmospheric  effects  and  radar  power 
limitations  are  negligible.  If  the  millimeter- wave  radar  has  10  dB  less  CNR  at  its  maximum 
range  for  resolution  of  20  m separation  targets  than  the  microwave  radar  at  its  maximum 
resolution  range  , what  is  the  relative  precision  of  the  radars  at  those  ranges. 

12-10.  What  effect  do  the  following  have  on  sensor  angular  resolution?  on  sensor  angular 
precision?  Assume  only  the  parameter  listed  is  allowed  to  change. 

a)  increasing  carrier  frequency 

b)  increasing  CNR 

c)  increasing  signal  bandwidth  (with  no  increase  in  signal  power) 

Calculate  the  aperture  required  for  a 10.6  pm  laser  radar  to  achieve  0.00001  radians 
resolution  assuming  the  Gaussian  FWHM  criterion. 

12-11.  Two  objects  are  to  be  discriminated  on  the  basis  of  temperature.  Object  1 is  described  by 
a distribution  with  //;  = 270  K and  cr,  = 5 K.  Object  2 is  described  by  a distribution  with 
= 300  K and  = 10  K.  What  is  the  best  error  performance  that  can  be  expected? 

12-12.  A pattern  recognizer  has  a feature  space  in  which  the  region  for  one  target  class  (Class  A) 
is  bordered  by  regions  for  five  other  classes.  Assuming  the  features  are  completely 
independent  and  each  pair  of  features  has  5%  Type  I and  5%  Type  II  errors,  what  is  the 
probability  of  correct  classification  and  the  probability  of  false  alarms  for  objects  of  Class 
A given  that  Class  A objects  are  5 times  as  numerous  as  the  objects  of  each  of  the  other  five 
classes.  The  probability  of  correct  classification  is  the  number  of  A objects  correctly 
classified  as  A objects  divided  by  the  total  number  of  A objects.  The  probability  of  false 
alarm  is  the  number  of  objects  falsely  classified  as  class  A divided  by  the  total  number  of 
objects  classified  as  A objects. 
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CHAPTER  13 


MODULATION  AND  DEMODULATION 


Modulation  and  Demodulation 

Information  is  often  carried  by  modulations  (distinct  variations  of  some  wave  characteristic 
such  as  frequency  or  amplitude)  imposed  on  sinusoidal  waves  of  otherwise  constant  character.  The 
waves  upon  which  the  modulation  is  imposed  are  called  carrier  waves.  Without  modulation  the 
carrier  is  usually  constant  and  non-varying.  The  signal  is  contained  within  the  modulation  imposed 
upon  the  carrier.  It  is  this  association  with  constant  non-varying  waves  from  which  the  term 
“carrier-to-noise  ratio”  used  in  Chapter  1 1 derives.  The  term  modulation  refers  to  both  the  process 
of  imposing  variations  (modulation)  on  a carrier  and  to  the  resulting  variation  themselves. [1],  [2] 
Demodulation  is  the  process  of  extracting  the  variations  from  the  received  radiation  and  converting 
them  into  signals  useable  by  the  sensor  system.  Modulators  impose  modulation  on  carriers; 
demodulators  extract  modulation  from  modulated  carriers. 

Modulation  and  demodulation  are  seldom  end  functions  of  sensors  but  one  or  the 
other  or  both  are  often  critical  intermediates  in  the  accomplishment  of  desired  end  functions  such 
as  parameter  estimation  or  tracking.  In  some  instances  the  sensor  will  perform  both  modulation  and 
demodulation.  For  example,  a radar  may  transmit  a frequency-modulated  waveform  and  receive  a 
modified  frequency-modulated  waveform  in  the  pursuit  of  estimating  the  range  and  velocity  to  a 
target.  In  others,  one  sensor  will  establish  a modulation  for  another  sensor  to  utilize,  such  as  a coded 
laser  designator  signal  for  a laser  guided  weapon.  In  others  the  interaction  of  a sensor  signal  with 
the  target  may  produce  a modulation  that  is  useful  for  another  function,  as  in  a conical  scan  tracking 
system. 


Many  types  of  modulation  can  be  imposed  on  either  acoustic  or  electromagnetic  waves. 
These  include:  amplitude  modulation,  frequency  modulation,  and  phase  modulation.  A fourth 
kind  of  modulation,  polarization  modulation  (switching  back  and  forth  between  two  orthogonal 
states  of  polarization)  can  be  imposed  only  on  electromagnetic  waves.  Amplitude,  frequency,  and 
phase  modulation  are  shown  graphically  in  Figure  13-1.  Amplitude,  frequency,  phase,  or 
polarization  modulation  may  be  pure  analog  techniques,  in  which  the  signal  is  continuously  variable 
in  both  time  and  “amplitude”,  or  they  may  be  implemented  as  discrete  techniques  in  which  the 
signal  is  sampled  in  time  and  possibly  the  “amplitude”  is  quantized  into  discrete  allowable  values. 
Discrete  techniques  must  be  sampled,  but  do  not  have  to  be  quantized.  A continuum  of  outputs  is 
possible.  Techniques  in  which  both  sampling  and  quantization  are  used  are  called  digital 
techniques.  The  discrete  levels  may  or  may  not  be  encoded  into  binary  or  other  formats  for 
processing  by  electronic  computers.  Figure  13-2  compares  an  analog  signal  with  one  of  many 
digital  representations  of  that  signal.  The 


389 


Figure  13-1.  Some  signal  modulation  schemes. 
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Figure  13-2.  Comparison  of  “Analog”  versus  “Digital”  signals. 
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key  difference  between  analog  and  digital  lies  in  the  continuous  versus  the  doubly  discrete  character 
of  the  signals  being  handled. 

Amplitude  modulation,  phase  modulation,  and  frequency  modulation  are  subtly  related.  All 
three  involve  time-dependent  variations  of  the  signal.  We  can  represent  a general  signal  as 

S[t)  - A{t)  cos[o(^)]  = A{t)  cos[ty + (^(0]  (13.1) 

where  A{t)  represents  any  amplitude  variation,  0(t)  is  the  total  phase  of  the  signal,  (Oc  is  the  carrier 
frequency  of  the  signal,  and  (jit)  is  variable  part  of  the  phase.  The  instantaneous  frequency  of  this 
signal  may  be  defined  as 

/ X 50  d(j) 

— . (13.2) 

ot  ot 

Let  us  now  attempt  to  modulate  this  signal  by  a modulation  function  m{f).  If 

A{t)  = km{t),  (13.3) 

where  A:  is  a scaling  factor,  then  we  have  amplitude  modulation.  Alternatively,  if  we  allow  the 
variable  phase  to  follow  the  modulation  function,  i.e., 

m(t),  (13.4) 

where  (/)()  is  an  arbitrary  initial  phase  and  A:  is  a scaling  factor,  then  we  have  phase  modulation.  The 
instantaneous  frequency  is 


co[t)  - CO^  + k 


dm{t) 


dt 


(13.5) 


Consequently,  phase  modulation  implies  frequency  variations.  For  this  reason,  phase  modulation 
is  sometimes  mis-identified  as  frequency  modulation.  If  we  integrate  the  modulation  function  over 
time  to  produce  an  altered  modulation  function  m ’(t) 


m 


(13.6) 


and  let  the  variable  phase  follow  this  integrated  modulation, 

^(a)  = km'{t) 


(13.7) 
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then  the  instantaneous  frequency  is  found  to  be 


Co{t)  — C0(.  + k — ~ + km[t) 


(13.8) 


The  frequency  varies  directly  with  the  modulation  function.  This  is  true  frequency  modulation. 
The  difference  between  frequency  and  phase  modulation  is  essentially  a single  integration  of  the 
modulation  function. 

Another  class  of  modulation,  called  pulse  modulation,  is  applicable  in  sampled  waveform 
scenarios.  The  information  in  a sampled  signal  can  be  represented  (or  encoded)  by  varying  the 
amplitude,  duration,  position  (relative  to  a time  reference),  or  repetition  rate  of  the  transmitted 
pulses.  The  way  in  which  information  is  encoded  by  several  types  of  pulse  modulation  are 
illustrated  in  Figure  13-3.  The  most  common  forms  of  pulse  modulation  are  essentially  forms  of 
amplitude  modulation.  However,  frequency  or  polarization  modulation  could  also  be  used  to  encode 
the  signal  amplitude  modulation. 


Figure  13-3.  Several  pulse  modulation  schemes. 


MODULATION  'ENVELOPE' 


PULSE  AMPLITUDE  MODULATION  (PAM) 


PULSE  DURATION  MODULATION  (PDM) 


PULSE  POSITION  MODULATION  (PPM) 


PULSE  RATE  MODULATION  (PRM) 
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Related  to  but  not  a true  form  of  pulse  modulation  is  pulse  code  modulation  (PCM).  PCM 
is  a truly  digital  modulation  technique.  In  PCM  the  signal  is  sampled,  the  sampled  signal  is 
quantized  into  two  more  discrete  levels,  and  the  quantized  level  is  transmitted  as  a unique  string  of 
bits,  the  bits  having  been  encoded  into  pulses.  Pulse  amplitude,  position,  duration,  pulse  rate, 
frequency,  polarization,  phase,  and  polarity  among  others  are  capable  of  being  used  to  encode  the 
bits. 


In  the  following  sections  each  of  the  major  modulation  schemes  will  be  examined  in  more 
detail.  The  nature  of  the  modulation  is  discussed  as  are  any  peculiarities  or  distinctive 
characteristics  it  may  possess.  Modulation  and  demodulation  hardware  is  also  discussed. 
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Amplitude  Modulation  (AM) 

Amplitude  modulation  (AM)  is  obtained  by  multiplying  a low  frequency  modulation  onto 
an  otherwise  constant  carrier  wave.  The  modulation  can  be  achieved  by  varying  the  gain  or  loss  of 
a component  in  the  electronics.  An  amplitude  modulated  signal  can  be  expressed  as 

E{t)  = A{1:)Eq  cos(2;t/c?)  (13.9) 

where  A(()  is  the  modulation  waveform  and is  the  carrier  frequency.  We  assume  that  0^(t)<  1 . 
If  the  modulation  is  a simple  sinusoid,  then  the  modulation  factor  can  be  written  as 

A(t)  = 0.5(^1  + a cos(27t (13.10) 

where  0 < a < 1,  is  the  depth  of  modulation  and  is  the  modulation  frequency.  Expansion  of  the 
terms  in  Eq.  (13.10)  yields 

E[t)  =0.5(1  + <2COs(2;^/^?))£'q  cosilTif^t) 


-0.5Eq  cos{27i;f(,t)  + acos{27i;fj^t)cos{27Tf^t)  (13.11) 


cos(2;r/c^)  + 

0.5a[cos(2;r(/c  - /^ » + cos(2;r(/c  + /^ )^)] 

Thus,  as  shown  in  the  upper  trace  of  Figure  13-4,  the  spectrum  of  the  amplitude-modulated  signal 
consists  of  the  carrier  and  two  equal  side-bands  shifted  from  the  carrier  by  ±f^.  The  amplitude  of 
the  sidebands  is  proportional  to  the  depth  of  modulation  but  is  never  greater  that  half  the  carrier 
amplitude.  The  spectrum  of  the  original  baseband  signal  (the  modulation)  is  shown  superimposed 
on  the  AM  signal  at  the  lefthand  edge  of  the  trace.  The  lower  trace  in  Figure  13-4  shows  the 
baseband  and  AM  signal  spectra  in  the  event  of  more  complex  modulation. 

The  total  bandwidth  of  an  AM  signal  is  thus  easily  seen  to  be  twice  the  maximum  modulation 
frequency  in  the  signal 


= 0.5^, 


B = 2f. 


Mmax  * 


(13.12) 
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Figure  13-4.  Spectra  of  baseband  and  amplitude-modulated  signals  for 
pure  sinusoidal  modulation  and  for  more  complex  modulation. 


PURE  SINUSOIDAL  MODULATION 


COMPLEX  MODULATION 


Amplitude  modulation  can  be  detected  (demodulated)  by  mixing  the  modulated  signal  with 
a local  oscillator  that  is  identical  to  the  unmodulated  carrier.  This  process  is  called  superheterodyne 
reception.  The  “dc”  or  baseband  component  of  the  mixer  output  voltage  will  reproduce  the 
modulation.  The  higher  frequency  components  which  are  present  in  the  mixer  output  are  usually 
easily  fdtered  to  give  only  the  modulation  waveform  J(i)  as  the  final  output.  A filtered  diode  is 
also  capable  of  serving  as  a demodulator.  Such  an  arrangement  is  shown  in  Figure  13-5.  The 
modulated  carrier  is  passed  through  a simple  diode  (half-wave  rectifier)  and  then  filtered  by  an  RC 
filter.  The  capacitor  charges  up  to  the  signal  voltage  during  the  positive  half-cycle,  and  decays 
during  the  negative  half-cycle.  The  RC  time  constant  is  set  to  provide  an  intercycle  decay  at  least 
as  large  as  the  maximum  rate  of  change  of  the  AM  envelope  function.  The  sawtooth  voltage  across 
the  capacitor  is  a reasonable  representation  of  the  original  modulation  waveform. 

There  are  some  applications  where  the  presence  of  the  carrier  in  the  transmitted  signal  is  not 
desired.  By  using  a balanced  modulator,  a signal  with  both  sidebands  present,  and  little  or  no 
carrier  can  be  produced.  This  is  called  double-sideband  suppressed-carrier  (DSB-SC)  operation. 
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Figure  13-5.  The  RC-filtered  diode  demodulator  and  an  illustration  of  its  principle  of  operation. 


a)  FILTERED  DIODE  DEMODULATOR 


b)  DEMODULATOR  OUTPUT 


The  balanced  modulator  is  shown  in  Figure  13-6.  By  adding  a standard  modulated  signal  to  the 
modulated  signal  produced  by  first  inverting  the  carrier  and  the  modulation,  the  carrier  can  be 
canceled  out  without  affecting  the  sidebands. 


Figure  13-6.  The  balanced  modulator. 
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Communication  applications  often  prefer  to  cancel  both  the  carrier  and  one  of  the  sidebands. 
This  single-sideband  (SSB)  mode  of  operation  frees  up  frequency  bandwidth  that  can  be  used  for 
additional  communication  channels.  Single  sideband  is  usually  produced  by  bandpass  fdtering  to 
eliminate  both  the  carrier  and  the  unwanted  sideband.  This  requires  a bandpass  fdter  with  very 
sharp  frequency  limits  and  strong  out-of-band  rejection.  It  may  be  impractical  to  generate  a desired 
SSB  signal  at  a specific  frequency  because  of  limitations  imposed  by  filter  technology. 

NOTE:  it  is  a general  rule  that  the  smaller  the  ratio  of  the  width  of  a filter  passband  is  to  the 
center frequency  of  that  pass  band,  the  more  difficult  (and  usually  more  expensive)  it  is  to  make  that 
filter.  This  true  regardless  of frequency,  whether  radio  frequency,  microwave,  infrared,  or  visible 
radiation.  Filters  with  ratios  of  1%  to  10%  are  relatively  easy  to  obtain.  Filters  with  ratios  ofO.P/o 
are  more  difficult  and filters  with  ratios  <0.01%  are  exceedingly  difficult  (but  seldom  impossible). 

For  example,  filters  may  be  available  that  can  separate  a 10  kHz  baseband  modulation  from 
a 500  kHz  carrier.  However,  filters  may  not  be  available  that  can  separate  the  10  kHz  modulation 
from  a 10  MHz  carrier.  However,  it  is  likely  that  filters  are  available  that  can  separate  a 500  kHz 
modulation  from  a 10  MHz  carrier.  SSB  generation  can  then  be  implemented  in  two  steps: 
modulation  of  a 500  kHz  carrier  followed  by  filtering  plus  subsequent  modulation  of  a 10  MHz 
carrier  followed  by  filtering.  This  is  shown  in  Figure  13-7.  Using  up  to  three  stages  of  modulation 
plus  filtering  SSB  signals  can  be  easily  produced  in  almost  frequency  range. 

The  baseband  signal  (frequency  <%)  may  be  recovered  from  SSB  modulation  (either  cOc  + 
a>M  or  a>c  - cof)  by  mixing  the  SSB  signal  with  the  carrier  signal  (frequency  cof).  The  output  of  the 
mixer  contains  a second  harmonic  term  (either  2 (0^  + cOm  or  2 cOc  - <%)  and  the  desired  baseband 
term  (either  -i-(%  or  -<%).  The  baseband  term  is  then  isolated  by  filtering. 

In  general,  AM  is  extremely  simple  to  produce  and  simple  to  demodulate.  However,  noise 
on  an  amplitude  modulated  signal  is  indistinguishable  from  signal  modulation.  Amplitude 
fluctuations  caused  by  variations  in  the  propagation  channel  (such  as  atmospheric  turbulence  or 
multipath  interference)  likewise  are  indistinguishable  from  signal  modulations.  For  this  reason, 
despite  its  simplicity,  AM  is  usually  used  only  when  high  signal  levels  and  insignificant  fluctuations 
are  expected. 


Figure  13-7.  Multistage  generation  of  a single-sideband  signal. 
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Frequency  Modulation  (FM) 

Frequency  modulation  (FM)  involves  imposing  small  changes  in  the  instantaneous 
frequency  of  a carrier  wave.  This  can  be  accomplished  by  varying  the  frequency  of  an  oscillator  to 
produce  the  original  frequency  modulation  and  then  mixing  this  signal  with  an  unmodulated  carrier 
or  by  directly  frequency  modulating  the  carrier  using  a frequency  (or  phase)  modulator.  Since  the 
instantaneous  frequency  is  modulated,  there  is  a maximum  frequency  fMaximum  ^ minimum 
frequency  /Minimum-  The  modulation  frequency  /Modulation  is  essentially  the  inverse  of  the  time  required 
to  shift  from  maximum  to  minimum  frequency.  An  important  quantity  in  discussing  FM  systems 
is  the  depth  of  modulation  p of  the  signal.  This  is  defined  by  the  relationship 


/?= 


A/  _ A/  _ \f, 
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If  P is  large  enough,  the  carrier  may  not  be  readily  distinguishable.  It  can  be  found  by  the  relation 
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(13.14) 


Consider  the  following  simple  sinusoidal  modulation  function 

m(t)  ^ P / M f M^)  (13-15) 

where  /m  is  the  modulation  frequency.  If  we  consider  next  a simple  sinusoidally  frequency- 
modulated  signal.  The  instantaneous  frequency  is  given  by 

f{t)  = fc  P f M • (13.16) 

In  frequency  modulation  the  phase  function  is  proportional  to  the  time  integral  of  the  modulation 
function 


^(0  = ^^\[dr[fc+PfM  cos(2;r/^r)] 

= 2ti /(.t  -f  2tiP sm{27T /^t) 

Thus,  the  signal  field  can  be  described  by  the  expression 

E{t)  - Eq  cos(2;t/c^ + ;^sin(2;r/^?)) . (13.18) 

This  signal  field  can  in  turn  be  expressed  as 
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(13.19) 


E{t)  = E, 


cos(2;r/(.  t)  cos(  P sin(2;r/^?)) 
- sin(2;r/cO  sin(^  sin(2;r/^r)) 


The  two  terms  involving  /?  can  be  expanded  in  a Fourier  series  in  which  is  the  fundamental 
frequency  and  recombined  with  the  carrier  frequency  to  give 

E{t)  = E\j^{p)Qoil7tfct) 

- J,  (y?)[cos(2;r(/c  - )?)  - cos(2;r(/c  + )t)] 

+ (y?)[cos(2;r(/c  - 2/^  )^)  + cos(2;r(/c  + 2/^ )?)]  (13.20) 

- - 3/m)0“  + 3/m)0] 

+ ••• 


where  the  J„{P)  are  Bessel  functions.  The  frequency  modulated  signal  consists  of  a carrier  of 
relative  amplitude  JgiP)  and  a set  of  symmetrically  placed  sidebands  at  frequency  separations  of  n/^, 
etc.  If  p becomes  large,  then  almost  all  of  the  energy  is  contained  in  the  sidebands,  and  little  is 
contained  in  the  carrier.  The  effective  bandwidth  B of  the  FM  signal  can  be  shown  to  be 

S = 2(/?+l)/„=2(A/  + /„)  (13.21) 


a relation  known  as  Carson’s  Rule. 

Frequency  modulation  can  be  produced  in  a variety  of  ways.  One  of  the  more  common 
techniques  is  Edwin  Armstrong’s  (Armstrong  was  one  of  the  founders  of  modem  radio  along  with 
Marconi,  de  Forest,  and  Samoff).  This  technique  combines  an  integrator  with  a phase  modulator 
(see  the  next  section)  to  produce  a frequency  modulator.  Armstrong’s  system  is  shown 
schematically  in  Figure  13-8.  The  mathematics  of  the  signal  generation  can  be  found  in  the  next 
section  on  phase  modulation. 

The  basic  concept  of  FM  demodulation  is  shown  in  Figure  13-9.  The  FM  signal  is  input  to 
a frequency  selective  network.  A frequency  selective  network  is  any  electronic  circuit  or  element 
that  gives  a response  (output)  that  varies  with  frequency.  Almost  every  real  circuit  acts  as  a 
frequency  selective  network  to  some  extent.  The  frequency  selective  network  converts  the 
frequency  modulation  into  an  amplitude  modulation.  The  amplitude  modulation  is  then 
demodulated  using  any  standard  AM  demodulator  such  as  the  diode  demodulator. 
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Figure  13-8.  Armstrong  frequency  modulation  technique. 


Figure  13-9.  Basic  principle  of  FM  demodulation. 
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EXAMPLE:  SIMPLE  RC  HIGH-PASS  FILTER 


A common  form  of  FM  signal  demodulator  is  the  limiter-plus-frequency  discriminator 
(commonly  called  a limiter-discriminator  and  shown  schematically  in  Figure  13-10).  The  limiter 
truncates  the  peaks  of  the  analog  signal  to  a constant  level  (eliminating  the  effects  of  any  amplitude 
modulation).  The  frequency  discriminator  is  a mixer-based  component  whose  output  voltage  is 
proportional  to  the  input  frequency.  [3]  It  converts  an  FM-modulated  amplitude-limited  carrier 
directly  into  an  AM  voltage  signal  that  is  easily  processed.  FM  signals  are  strongly  resistant  to 
degradation  from  noise  and  from  channel  fluctuations.  For  this  reason  it  is  preferred  in  systems 
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where  low  signal-strengths  are  expected.  As  long  as  the  CNR  is  much  greater  than  unity,  then  the 
signal-to-noise  ratio  in  an  FM  system  is  given  by 


SNR  = XSp^CNR  . (13.22) 

If  the  depth  of  modulation  is  large  (greater  than  unity),  the  SNR  can  be  appreciable  even  if  the  CNR 
is  relatively  small. 


Figure  13-10.  Frequency  discriminator  based  on  a double-balanced  mixer. 


NO  AMPLITUDE  MODULATION 


There  is  a practical  limit  to  how  low  CNR  may  go  while  maintaining  the  performance  of 
Eq.(13.22).  Frequency  discriminators  and  other  frequency  demodulators  exhibit  a threshold  effect. 
As  the  threshold  is  approached  from  the  direction  of  high  CNR,  the  noise  in  the  output  rises  rapidly. 
The  noise  takes  the  form  of  spikes  that  yield  clicks  in  the  frequency  demodulator  output.  As  long 
as  the  CNR  is  well  above  (greater  than  3 dB)  the  threshold  level,  then  the  frequency  demodulator 
can  maintain  a lock  on  the  signal.  Typical  thresholds  are  of  the  order  of  CNR=10. 
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Phase  Modulation  (PM) 

AM  and  FM  can  be  used  to  transmit  analog  or  digital  information.  Phase  modulation  (PM) 
is  usually  used  to  transmit  digital  information.  A phase  shifter  is  used  to  impose  discrete  phase 
shifts  of  either  0 or  1 80  degrees  onto  a carrier  at  well-defined  intervals  (time  difference  equal  to  one 
over  the  modulation  frequency).  A “0"  bit  might  be  represented  by  0 degrees  absolute  phase  (no 
shift)  while  a “1"  bit  might  be  represented  by  a 1 80  degrees  absolute  phase.  Differential  phase  shift 
modulation  is  a variant  of  PM  that  involves  changing  the  modulation  only  when  bits  change  from 
“0"  to  “1"  or  from  “1"  to  “0"  in  a string  of  digital  data.  The  instantaneous  phase  of  the  carrier  can 
be  determined  by  a phase  discriminator  (a  device  that  works  very  similarly  to  a frequency 
discriminator).  Phase  modulation  is  also  highly  resistant  to  noise  and  channel  fluctuation  effects. 

The  Armstrong  phase  modulation  technique  is  shown  in  Figure  13-11.  First,  the  carrier  is 
split  into  two  equal  parts.  One  part  is  then  modulated  by  the  modulation  signal  m{f)  using  a simple 
balanced  modulator.  For  carrier  signal 

E{t)  = A^smco^t  (13.23) 

and  modulation  signal 

m[t)-/^smcOj^t  (13.24) 

the  output  of  the  balanced  modulator  is 

Q.5E{t)m{t)  = Q.5A^P  sin<z>^?  . (13.25) 


Figure  13-11.  Armstrong  phase  modulation  system. 
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The  second  part  of  the  carrier  is  shifted  in  phase  by  90°  and  then  added  to  the  negative  of  the 
output  of  the  balanced  modulator.  The  sum  yields 

Eourit)  = 0.5 sin -0.5 sinty^?  sinco^f 

= 0.5  A co^co ct  + P sin  CO  j^i\  (13.26) 

= 0.5  A(.  cos[ty(./  + m{t)^ 

which  is  obviously  a phase-modulated  signal. 

Demodulation  of  phase-modulated  signals  is  accomplished  with  phase  detectors.  The 
simplest  phase  detector  is  a doubly-balanced  mixer  in  which  the  phase-modulated  signal  is  input  to 
the  RF  port  and  a signal  with  the  same  average  frequency  but  no  phase  modulation  is  input  to  the 
LO  port.  The  output  of  the  IF  port  is  a signal  proportional  to  the  phase  difference  between  the  RF 
and  LO  signals.  In  many  systems  there  is  an  arbitrary  total  phase  accumulation  that  depends  on  the 
exact  circumstances  of  the  signal  propagation.  This  phase  is  not  compensated  in  the  simple  mixer. 
However,  if  a period  of  known  zero  phase  shift  is  contained  in  the  signal,  then  the  overall  phase  can 
be  eliminated. 

Since  phase  and  frequency  modulation  are  intimately  related  it  is  possible  to  use  a frequency 
demodulator  to  recover  the  phase  modulation.  If  a phase-modulated  signal  is  processed  by  a 
frequency  demodulator,  the  output  will  be  proportional  to  the  derivative  of  the  modulation  function. 
Thus,  the  integral  of  the  output  of  a frequency  demodulator  gives  the  original  phase  modulation. 
By  adding  an  integrator  to  the  output  of  a frequency  demodulator  we  obtain  a phase  demodulator. 
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Pulse  Modulation 


Pulse  modulation  is  the  generic  class  of  modulation  schemes  in  which  the  modulation  signal 
m{t)  to  be  transmitted  is  encoded  in  the  form  of  pulses.  The  signal  is  sampled  at  a regular  interval 
and  the  measured  amplitude  of  m at  each  sample  point  is  used  to  select  a particular  value  of  some 
parameter  of  a train  of  pulses.  For  example,  the  values  of  m may  be  used  to  determine  the 
amplitudes  of  the  pulse  heights  of  the  pulse  train.  The  frequency  of  the  pulse  or  the  pulse  duration 
or  the  exact  timing  of  the  start  of  each  pulse  is  irrelevant  to  the  encoded  information.  Only  the  pulse 
amplitude  carries  information  about  m{t).  This  form  of  pulse  modulation  is  called  pulse  amplitude 
modulation  (PAM).  NOTE:  pulse  modulation  is  a discrete  technique  in  that  it  requires  sampling. 
However,  it  may  or  may  not  be  digital  as  there  is  no  requirement  to  quantize  the  encoding.  In  the 
example  of  pulse  amplitude  modulation  above,  there  is  no  constraint  placed  on  the  amplitude.  It 
may  be  continuous  and  exactly  equal  to  the  value  of  m{f)  at  the  time  of  sampling.  On  the  other  hand, 
it  may  be  quantized.  This  is  a choice  left  to  the  system  designer. 

As  indicated  in  Figure  1 3-3,  virtually  any  parameter  of  the  pulses  may  be  used  to  encode  the 
data  independent  of  the  other  parameters.  The  sampled  values  of  m(t)  may  be  used  to  determine  the 
length  (duration)  of  each  pulse.  This  produces  what  is  known  as  pulse  duration  modulation 
(PDM)  or  pulsewidth  modulation  (PWM).  The  sampled  values  of  m(t)  may  be  used  to  determine 
the  time  position  (relative  to  some  reference  - often  the  time  at  which  each  sample  is  obtained)  at 
which  the  pulse  is  transmitted.  The  longer  the  time  delay,  the  larger  the  amplitude  of  m being 
represented.  This  form  of  pulse  modulation  is  called  pulse-position  modulation  (PPM).  The  pulse 
repetition  frequency  (PRF)  of  a very  high  rate  pulse  train  (many  pulses  are  transmitted  per  sampling 
interval)  may  be  use  to  encode  the  sampled  value  of  m(t).  This  form  of  pulse  modulation  is  called 
pulse  rate  modulation  (PRM).  The  four  techniques  mentioned  so  far  are  illustrated  in  Figure  1 3-3 . 
There  are  at  least  two  others  that  warrant  mention  but  cannot  be  depicted  on  the  same  figure.  One 
of  these  is  pulse  frequency  (or  wavelength)  modulation.  In  this  encoding  scheme  the  frequency 
of  the  transmitted  radiation  making  up  the  pulse  is  varied  in  proportion  to  the  sampled  value  of  m(t) . 
This  form  of  pulse  modulation  can  be  considered  to  a form  of  sampled  frequency  modulation.  A 
closely  related  form  of  pulse  modulation  encodes  the  information  on  m(t)  into  the  polarization  angle 
of  the  radiation  transmitted  in  the  pulse.  This  is  called  pulse  polarization  modulation. 

The  author  believes  this  list: 

* pulse  amplitude  modulation  (PAM) 

* pulse  duration  modulation  (PDM)  or  pulselength  modulation 

* pulse-position  modulation  (PPM)  or  pulse  timing  modulation 

* pulse  rate  modulation 

* pulse  frequency  modulation  or  pulse  wavelength  modulation 

* pulse  polarization  modulation 

to  be  comprehensive.  There  are  a very  limited  number  of  fundamental  properties  of  pulses.  One 
might  be  tempted  to  add  pulse  shape  modulation  where  the  shape  varies  from  rectangular  to 
triangular  or  some  other  shape  as  m varies.  The  author  thinks  this  would  terribly  difficult  to 
implement  and  is  not  likely  to  be  attempted.  One  might  consider  varying  the  phase  of  the 
transmitted  waveform.  However,  the  frequency  of  radiation  in  a pulse  is  easily  determined,  but 
phase  is  not.  For  example,  the  phase  depends  on  the  separation  of  the  transmitter  and  receiver. 
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There  is  no  reference  from  which  the  applied  phase  (proportional  to  m(t))  may  be  determined  from 
the  separation  phase.  In  ordinary  phase  modulation  it  is  changes  in  phase  which  are  measured.  This 
cannot  be  done  in  simple  pulses.  There  are  no  other  unique  properties  of  pulses  that  the  author  can 
identify  that  could  be  used  to  encode  the  signal  m(t). 

Not  all  pulse  modulation  schemes  are  equal.  Some  are  easier  to  implement  than  others.  For 
example,  it  is  easy  to  delay  the  emission  of  a pulse  in  direct  relation  to  the  amplitude  of  m.  At  the 
same  time,  it  may  be  difficult  to  vary  the  wavelength  or  the  polarization  of  the  radiation. 
Conversely,  polarization  or  frequency  modulation  are  not  as  easily  detected  as  time  and  amplitude 
modulation.  This  may  enhance  the  covertness  of  a signal.  Only  those  individuals  intended  to 
receive  the  message  are  likely  to  have  the  necessary  receivers.  Some  schemes  are  more  immune  to 
the  effects  of  noise  and  anomalies  in  the  propagation  medium.  Pulse  amplitude  modulation  is  very 
susceptible  to  both  noise  and  propagation  effects.  Receiver  noise  will  directly  increase  or  reduce 
the  perceived  value  of  m because  it  adds  to  or  subtracts  from  the  pulse  amplitude.  Propagation 
anomalies  (multipath  fading  or  turbulent  scintillation)  will  cause  fluctuation  in  the  received  signal 
strength  that  will  be  perceived  as  real  signal  fluctuations  inm.  On  the  other  hand,  amplitude  noise 
and  propagation  anomalies  will  produce  almost  impact  on  the  pulse  repetition  rate.  Thus,  PAM  is 
far  more  susceptible  to  noise  and  anomalies  than  any  of  the  other  techniques. 
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Pulse  Code  Modulation 


Pulse  code  modulation  (PCM)  is  a truly  digital  modulation  technique.  In  PCM  the  signal 
is  sampled,  the  sampled  signal  is  quantized  into  two  more  discrete  levels,  and  the  quantized  level 
is  transmitted  as  a unique  string  of  bits,  the  bits  having  been  encoded  into  pulses.  As  we  shall  see, 
pulse  amplitude,  position,  duration,  pulse  rate,  frequency,  polarization,  phase,  and  polarity  among 
others  are  capable  of  being  used  to  encode  the  bits.  PCM  is  not  considered  a true  form  of  pulse 
modulation  because  the  amplitude  of  the  modulating  signal  is  not  directly  mapped  onto  some 
parameter  of  simple  pulses,  instead  it  is  mapped  into  a digital  numerical  code. 

Most  PCM  encoding  techniques  use  binary  numbers  to  encode  the  signal.  Figure  13-12 
compares  the  decimal  number  system  with  which  everyone  is  familiar  to  the  binary  number  system 
(which  is  commonly  used  in  computers  and  digital  communications) . In  the  decimal  number  system, 
large  numbers  are  represented  by  digits.  Each  decimal  digit  represents  a power  of  ten.  In  a four- 
digit decimal  number,  the  most  significant  digit  represents  the  number  of  thousands  contained  in  the 
number.  There  are  ten  possible  values  W for  the  number  of  thousands  in  any  number  - 0,  1,  2,  3, 
4,  5,  6,  7,  8,  or  9.  The  next  most  significant  digit  represents  one  power  of  ten  less  (hundreds).  This 
digit  is  the  number  of  hundreds  left  in  the  number  after  all  of  the  thousands  have  been  accounted  for. 
The  third  most  significant  digit  represents  two  powers  of  ten  less  (tens).  The  least  significant  digit 


Figure  13-12.  Decimal  versus  binary  number  systems. 


DECIMAL  NUMBERS 
T 

H H 
O U 
U N 
S D 

A R T O 
N E E N 
DONE 
S S S S 

W X Y Z 


BINARY  NUMBERS 


E 

I F 

GOTO 
H U W N 
T R O E 
S S S S 

W X Y Z 


"W”  = 0,  1,  2,  3,  4,  5,  6,  7,  8,  or  9 


”W"  = 0 or  1 


WXYX  - W X 1000 
+ X X 100 
+ Y X 10 
+ Z X 1 


WXYZ  - W X 8 
+ X X 4 
+ Y X 2 
+ Z X 1 


406 


represents  the  ones.  A four-digit  decimal  number  can  represent  any  decimal  number  between  0 and 
9999.  Negative  numbers  can  be  accommodated  by  adding  a sign.  Fractions  can  be  accommodated 
by  adding  a “decimal  point”  and  digits  which  represent  tenths,  hundredths,  thousandths,  etc. 
Decimal  numbers  of  arbitrary  size  can  be  generated  by  adding  more  digits. 

Binary  numbers  are  based  on  powers  of  two  (as  opposed  to  ten).  The  binary  digits  (usually 
contracted  to  “bits”)  represent  the  number  of  ones,  twos  (=2'),  fours  (=2^),  eights  (=2^),  etc.  in  the 
number.  There  are  only  two  values  possible  in  each  bit:  0 or  1 . Binary  numbers  of  arbitrary  size 
can  be  constructed  by  adding  more  bits  representing  still  higher  powers  of  two.  Negative  numbers 
and  fractions  can  be  handled  analogous  to  decimal  numbers. 

Now  consider  Figure  13-13.  A modulating  signal  is  sampled  (at  regular  intervals  in  this 
figure,  but  irregular  sampling  is  also  allowable)  and  the  value  at  each  sample  is  assigned  to  the 
nearest  digital  signal  level.  At  sample  1,  the  nearest  level  is  level  4.  At  sample  2,  the  nearest  level 
is  level  6.  The  digital  signal  levels  are  usually  equally  spaced  (on  either  a linear  or  a logarithmic 
scale)  and  bear  no  immediately  obvious  relationship  to  the  amplitude  of  the  modulation  signal.  For 
example,  the  signal 


Figure  13-13.  Digital  encoding. 
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may  vary  between  -10  and  -100  volts  or  between  -10  and  +10  volts  or  between  -10  and  +10 
microvolts,  yet  the  same  digital  levels  0, 1,2,...  ISmaybeused.  This  is  why  digitization  of  a signal 
is  sometimes  called  encoding  analogous  to  the  cryptographic  practice  of  assigning  some  arbitrary 
symbol  or  group  of  letters  and  numbers  to  represent  a specific  letter  or  number.  Here,  each  real 
signal  level  is  assigned  to  a specific  digital  level  (or  number). 

Digitization  is  the  first  step  in  pulse  code  modulation.  The  next  step  is  to  convert  the  digital 
signal  level  into  a number,  usually  binary.  In  the  example  above,  possible  digital  levels  range  from 
0 to  8.  This  range  can  be  spanned  by  four-bit  binary  numbers.  The  first  value  4 (sample  number 
one)  becomes  the  binary  number  0100.  The  second  value  6 (sample  number  two)  becomes  the 
binary  number  0110  and  so  on.  Now  the  binary  numbers  are  translated  into  pulses  which  form  the 
pulse  code  modulation.  Binary  numbers  are  usually  used  because  the  translation  of  each  bit  requires 
selection  between  only  two  possible  waveforms.  One  form  of  coding  might  be  to  represent  each  bit 
by  a possible  pulse.  If  the  pulse  is  present  in  a designated  time  slot,  the  bit  is  assumed  to  be  a “1"; 
if  the  pulse  is  absent,  then  the  bit  is  assumed  to  be  a “0".  This  is  called  on-off  keying  (OOK).  In 
on-off  keying,  the  number  four  (0100)  is  represented  as  NoPulse  - Pulse  - NoPulse  - NoPulse. 

Figure  13-14  illustrates  a number  of  the  keying  formats  that  can  be  used  with  pulse  code 
modulation.  In  usual  practice,  each  bit  is  assigned  a specific  time  slot.  Within  that  time  slot,  the 


Figure  13-14.  Examples  of  keying  (bit  encoding)  formats 
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keying  technique  modulates  a pulse  or  a pulse  train  in  a fashion  that  the  receiver  can  demodulate  and 
recover  the  meaning  of  the  original  bit  (i.e.,  0 or  1).  The  formats  shown  in  Figure  13-14  are  not  all- 
inclusive.  For  example,  polarization  shift  keying  (e.g.,  up  linear  polarization  = 0,  sideways  linear 
polarization  = 1)  is  clearly  possible  but  not  explicitly  shown.  Not  all  keying  formats  can  be  applied 
to  all  types  of  signals.  For  example,  polarity  shift  keying  is  possible  when  the  transmission  system 
involves  voltages  transmitted  over  a wire,  but  is  meaningless  when  applied  to  electromagnetic 
radiation  transmitted  through  space.  Similarly  polarization  shift  keying  is  applicable  to 
electromagnetic  radiation  but  not  to  voltages  on  a wire.  Some  keying  techniques  such  as  OOK  are 
universally  applicable. 

Not  all  keying  formats  are  equal  when  it  comes  to  robustness  in  the  presence  of  receiver 
noise  or  transmission  anomalies.  Let  us  compare  two  commonly  used  techniques,  OOK  and  pulse- 
position  modulation  (PPM).  In  OOK,  the  optimum  demodulator  will  examine  the  signal  strength 
S in  the  bit  interval  (assumed  to  be  time  = 0 to  time  = 7) 

= (13.27) 


and  try  to  determine  whether  it  is  closer  to  “zero”  or  some  “non-zero”  value.  The  received  “signal” 
s(t)  has  contributions  from  the  true  signal  y(t)  and  from  noise  n(t) 

s[t)  = y(^t)  + n[t)  (13.28) 

Since  the  system  will  always  have  receiver  noise,  then  S>0,  even  if  y=0.  This  bears  a strong 
resemblance  to  the  radar  detection  problem  analyzed  in  Chapter  1 1 . The  optimum  decision  criterion 
for  OOK  is  a threshold  test: 


> 

Choose  bit  = 1, 

(13.29a) 

Sr 

< 

Choose  bit  = 0, 

(13.29b) 

where  Sj-  is  chosen  to  minimize  the  probability  of  error. 

The  bit  error  probability  Pr(e)  is  the  sum  of  the  probability  that  a 1 is  falsely  called  a 0 and 
the  probability  that  a 0 is  falsely  called  a 1 

Pr(e)  = P(0|l)+P(l|0).  (13.30) 

Evaluation  of  the  bit  error  probability  will  be  left  to  a later  volume  when  communication  systems 
are  discussed.  At  this  point  we  will  only  say  that  the  threshold  is  usually  adjusted  to  equalize  F*(0|  1) 
and  F*(1|0).  This  will  produce  bit  error  probabilities  that  are  almost  if  not  absolutely  minimum. 
Nevertheless  it  should  be  obvious  that  if  sensor  noise  goes  up  significantly,  then  the  bit  error  rate 
will  also  go  up  because  P(1 10)  will  increase.  Similarly,  if  there  is  significant  fading  of  the  signal. 
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then  the  bit  error  rate  will  increase  because  P(0|1)  will  increase. 

Now  consider  pulse  position  modulation.  A pulse  is  always  present  in  one  or  the  other  half 
of  the  bit  time.  If  we  associate  a pulse  being  present  in  the  first  half  of  the  bit  time  with  a 0 and  a 
pulse  being  present  in  the  second  half  of  the  bit  time  with  a 1 then  we  may  define  two  quantities 

5*0=]^  dt  s{t)  (13.31) 

and 

S,=\ljts(t).  (13.32) 

In  the  absence  of  noise,  a logical  0 should  produce  a large  Sq  and  a zero  S,,  while  a logical  1 should 
produce  a large  S;  and  a zero  Sg.  The  presence  of  noise  will  cause  the  “zero”  values  to  assume  non- 
zero levels.  The  presence  of  fading  will  tend  to  affect  both  Sg  and  S,  more  or  less  equally.  The 
optimum  decision  criterion  in  this  instance  is  a ratio  test 


> 

Choose  bit  = 0, 

(13.33a) 

< 

Choose  bit  = 1 . 

(13.33b) 

Using  this  test,  errors  only  occur  if  noise  becomes  comparable  to  the  signal  level  and  if  the  fades  are 
so  deep  as  to  make  the  noise  significant.  Any  keying  technique  that  uses  a ratio  test  will  have 
substantially  lower  bit  error  rates  at  a given  carrier-to-noise  ratio  than  any  technique  that  uses  a 
threshold  test.  Similarly,  any  technique  that  uses  a ratio  test  will  be  less  susceptible  to  bit  errors 
produced  by  fading  than  any  technique  that  uses  a threshold  test.  Although  OOK  is  the  simplest 
pulse  code  keying  scheme,  it  is  seldom  used  because  of  noise  and  fading  problems.  PPM,  PSK,  or 
FSK  all  provide  superior  performance  because  it  is  possible  to  use  ratio  tests  in  the  decision  process. 

Delta  modulation  is  an  important  form  of  PCM.  Figure  13-15  shows  the  modulator  and 
demodulator  portions  of  a delta  modulation  transmission  system.  The  output  A(t)  of  a difference 
amplifier  is  modulated  by  a simple  pulse  train  p^{f).  The  output  is  a train  of  pulses  of  constant 
amplitude  but  varying  polarity.  The  sign  of  the  output  polarity  depends  only  on  the  sign  of  the  input 
function  A(t).  Note:  in  practice  any  two  discernible  parameters  (polarization,  frequency,  pulse- 
position,  etc.)  could  be  used  to  transmit  the  sign  of  the  difference.  The  output  pulse  train  is 
simultaneously  transmitted  over  the  communication  channel  to  the  receiver  and  directed  into  an 
integrator.  The  output  of  the  integrator  q{i)  is  subtracted  from  the  input  modulation  m(t)  to  generate 
the  difference  signal  A(t).  Thus, 

= (13.34) 

where  Sgn(x)  is  the  signum  or  sign  function  and 
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(13.35) 


A(?)  = m{t)  - q{t)  = m{t)  ~^dt  pX^)  ■ 

The  name  delta  modulation  derives  from  the  fact  that  the  transmitted  signal  is  proportional  to  the 
difference  between  the  modulation  signal  m{f)  and  an  approximation  to  the  modulation  q{f).  Figure 
13-16  shows  an  example  of  a modulation  waveform,  its  delta  function  approximation,  and  the 
transmitted  pulse  train. 

At  the  receiver  end  of  the  system  a quantizer  analyzes  the  signal  and  determines  whether 
each  pulse  is  positive  or  negative  and  generates  a pulse  train  that  reproduces  the  original pjj)-  The 
quantizer  serves  to  minimize  the  effects  of  noise  and  channel  propagation  characteristics  on  the 
signal.  An  integrator  then  adds  (or  subtracts)  the  output  pulse  train.  This  signal  should  be  the  same 
as  the  approximate  modulation  q{f).  A fdter  removes  the  high  frequency  components  associated 
with  the  pulse  modulation  yielding  a smoothed  approximation  m ’{f)  to  the  original  modulation 
signal. 


Figure  13-15.  A delta  modulation  transmission  and  reception  system. 
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Figure  13-16.  The  waveforms  associated  with  delta  modulation. 
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Problems 


13-1.  A modulated  signal  is  investigated  with  a spectrum  analyzer.  The  spectrum  consists  of  a 
strong  spike  surrounded  by  two  equal  and  symmetric  sidebands  of  much  lower  amplitude. 
What  form  of  modulation  is  most  likely  to  be  responsible  for  the  observed  spectrum? 

13-2.  A modulated  signal  is  investigated  with  a spectrum  analyzer.  The  spectrum  consists  of  a 
series  of  equally  spaced  lines  of  widely  varying  amplitude.  What  form  of  modulation  is  most 
likely  to  be  responsible  for  this  observed  spectrum? 

13-3.  Describe  in  simple  mathematical  terms  the  difference  between  frequency  modulation  and 
phase  modulation. 

13-4.  Edwin  Armstrong  ’ s introduction  of  frequency  modulation  led  to  maj  or  improvements  in  the 
quality  of  radio  transmissions  compared  to  de  Forest’s  amplitude  modulation.  Why? 

13-5.  A binary  sequence  1001101011100011  is  to  be  transmitted  by  a pulse  code  modulation 
system.  Assume  a 1 MHz  bit  transmission  rate  with  no  dead  time  between  bits  and  1 psec 
bit  time.  Draw  with  appropriate  scale  the  transmitted  waveform  corresponding  to  this  binary 
sequence  if  on-off  keying  is  used.  On  the  same  drawing  draw  the  transmitted  waveform  if 
pulse-position  modulation  is  used. 

13-6.  An  atmospheric  propagation  channel  is  subject  to  multipath  fading.  Which  PCM  technique 
will  give  better  performance,  FSK  or  OOK?  Why? 
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CHAPTER  14 


IMAGING  AND  IMAGE-BASED  PERCEPTION 


Generalized  Images 

Nearly  everyone  has  a firm  idea  of  what  constitutes  an  image.  Most  would  describe  an 
image  as  a two-dimensional  array  (horizontal  by  vertical  angular  dimensions)  of  brightness,  and 
possibly  color,  data  that  is  sensed  by  the  eye  and  interpreted  by  the  brain.  This  is  indeed  a form  of 
imagery.  A few  other  individuals  would  use  the  same  physical  description  but  would  include 
acquisition  by  sensors  such  as  film,  television,  or  possibly  even  infrared  imagers.  In  fact,  all  of  these 
are  images,  but  the  aggregate  does  not  approach  a more  generalized  concept  of  images.  The  author 
prefers  the  more  general  definition  of  an  image  as 

“a  distribution  in  at  least  two  spatial  dimensions  of  one  or  more  parameters  related  to 
physical  properties  of  an  object  or  scene.  ” 

In  this  definition  there  are  many  possible  spatial  dimensions  that  can  be  used.  There  are  also  many 
different  physical  properties  that  can  be  distributed  over  those  dimension.  T able  14-1  lists  a number 
of  these  potential  spatial  dimensions  and  a number  of  possible  physical  properties. 


Table  14-1.  Potential  object  characteristics  that  can  form  images. 

POTENTIAL  SPATIAL  DIMENSIONS  POTENTIAL  PHYSICAL  PROPERTIES 


AZIMUTH  (or  BEARING) 

ELEVATION  ANGLE 
RANGE 

CARTESIAN  COORDINATES  (x,  y,  z) 
DEPTH  (or  ALTITUDE) 

MAP  COORDINATES 
CROSS-RANGE 


COLOR 

REFLECTIVITY 

REFLECTED  INTENSITY 

RADIANCE 

CONCENTRATION 

TRANSMITTANCE  (or  ABSORPTANCE) 

VELOCITY 

TEMPERATURE 

RANGE 

RADAR  CROSS  SECTION 


For  example,  a classic  visual  image  is  a distribution  of  reflected  intensity  over  azimuth  and 
elevation  angles.  However,  a distribution  of  radar  cross  section  as  a function  of  range  and  cross- 
range also  forms  an  image.  Such  an  image  could  be  produced  by  a synthetic  aperture  radar  (SAR) 
or  by  a range-Doppler  imaging  radar.  A distribution  of  infrared  radiance  versus  Cartesian 
coordinates  (x  = along  track  distance,  y = cross-track  distance)  is  another  form  of  image  that  could 
be  produced  by  a line  scan  thermal  reconnaissance  camera.  An  AN/KAS-1  sensor  essentially 
produces  an  image  of  path-integrated  infrared  absorptance  (of  chemical  vapors  in  the  air)  as  a 
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function  of  azimuth  and  elevation  angle.  In  essence,  any  combination  of  coordinates  and  physical 
parameters  can  form  an  image. 

An  image  may  have  more  than  one  physical  parameter.  An  active-passive  infrared  imager 
[1]  can  produce  a distribution  of  range,  velocity,  reflectivity,  and  infrared  radiance  as  a function  of 
azimuth  and  elevation  angle.  Obviously,  such  an  “image”  is  not  readily  understandable  by  human 
visual  processing  - human  vision  is  really  only  capable  of  processing  roughly  four  dimensions  of 
data  (horizontal,  vertical,  brightness,  and  color)  not  six.  However,  every  sub-image  (one  parameter 
versus  the  spatial  dimensions)  can  be  readily  viewed  and  understood. 
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Resolution  versus  Coverage 

Images  contain  information.  Whether  the  image  is  analog  or  digital,  the  spatial  context  is 
ultimately  essentially  quantized.  We  use  the  term  pixel  (a  contraction  of  “picture  element”)  to 
denote  any  one  of  the  discretely-addressable  regions  in  the  image  space  from  which  an  image  is 
constructed  by  assignment  of  parameter  values.  Pixels  may  represent  actual  data  samples  or  they 
may  be  discrete  elements  of  the  display  (such  as  individual  cells  in  a gas  discharge  (plasma  panel) 
display  or  individual  RGB  (red-green-blue)  phosphor  clusters  in  a color  TV  monitor). 

Resel  is  a contraction  of  “resolution  element”.  A resel  is  any  region  of  image  space  whose 
size  in  each  dimension  is  equal  to  the  system  resolution  in  that  dimension.  Resolution  may  be 
limited  by  external  factors  (e.g.,  atmospheric  turbulence,  microwave  multipath,  etc.),  the  measure- 
ment physics  (e.g.,  pulse  duration,  beamwidth,  collection  aperture,  etc.),  data  sampling  limitations 
(e.g.,  number  of  detector  elements,  digitization  rate,  etc.),  or  the  display  (e.g.,  electron  beam  spot 
size  greater  than  the  line  spacing  in  a television  display,  etc.). 

An  image  may  have  pixels  which  are  smaller  than  its  resels.  However,  it  is  arguable  as  to 
whether  it  is  really  possible  to  have  resels  that  are  smaller  than  the  pixels.  One  group  will  argue  that 
the  finite  spacing  of  pixels  effectively  prohibits  the  acquisition  of  information  with  a scale  smaller 
than  the  pixel  spacing.  This  can  be  considered  to  limit  the  resolution.  If  one  sample  is  all  that  is 
obtained,  the  author  will  agree  with  this  first  group.  A second  group  argues  that  if  additional  looks 
are  obtained  with  sub-pixel  offsets  relative  to  the  first,  then  the  aggregate  of  multiple  looks  will  yield 
smaller  scale  information  down  to  the  limit  of  the  resel.  The  author  agrees  with  the  assertion  but 
would  suggest  that  the  aggregated  data  has  a reduced  pixel  size.  Thus,  it  is  likely  that  the  pixel  size 
must  be  smaller  than  or  equal  to  the  resel  size  in  any  system.  There  are  two  common  units  of 
resolution  that  need  explanation.  The  first  is  TV  line  (TVL).  A TVL  is  the  spacing  between 
resolvable,  fully-interlaced  raster  lines  that  equals  the  projected  size  of  a resolution  element.  That 
is. 


1 TVL  = 1 resel  (14.1) 

The  second  unit  is  the  line  pair  (LP).  A line  pair  is  the  width  of  a light  bar-dark  bar  pair  in  a 
resolution  chart  that  would  exhibit  a depth  of  modulation  of  only  50%  when  placed  near  the  target 
and  observed  by  the  system.  One  line  pair  is  the  wavelength  of  one  cycle  of  a light-dark  square 
wave.  It  is  often  used  interchangeably  with  the  unit  called  a cycle.  That  is, 

1 LP  = 1 cycle  = 2 resels  = 2 TVL  (14.2) 

These  units  are  usually  combined  with  a unit  length  or  unit  angle,  such  as  LP/mm,  cycles/mrad, 
TVL/inch  when  describing  the  resolution  of  a system. 

When  dealing  with  images  and  imaging  systems,  there  are  several  terms  that  must  be 
understood.  These  are  field  of  regard,  field  of  view,  and  instantaneous  field  of  view.  The 
relationship  of  these  terms  to  each  other  is  illustrated  in  Figure  14-1.  Images  are  typically  generated 
by  scanning  a detector  array  over  a limited  range  of  angles.  The  field  of  view  (FOV)  is  defined  by 
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the  range  of  angles  that  is  scanned  and/or  subtended  by  the  detector  array.  Each  frame  of  image  data 
covers  one  field  of  view.  At  any  given  instant  of  time,  the  detector  elements  each  look  at  a subset 
of  the  field  of  view.  The  instantaneous  field  of  view  (IFOV)  is  usually  defined  as  the  angular 
portion  of  the  field  of  view  that  can  be  seen  by  one  detector  at  any  instant  in  time.  Occasionally,  the 
IFOV  is  defined  as  that  subset  of  the  field  of  view  that  is  viewed  by  the  entire  detector  array  at  any 
one  instant.  Context  usually  removes  any  ambiguity  in  the  assignment.  Field  of  regard  (FOR)  is 
the  range  of  angles  that  can  be  imaged  if  the  FOV  is  moved  around  by  a pointing  system. 

The  IFOV  can  be  determined  by  the  dividing  the  detector  dimension  d by  the  effective  focal 
length  (image  focal  length / times  any  afocal  magnification  M) 

e,For=dl  fM  = 0„l  M . (14.3) 

The  FOV  is  determined  by  the  limits  of  the  scanner  motion,  modified  by  any  afocal  magnification 
between  the  scanner  and  the  output  aperture,  plus  one  IFOV, 

^FOV  ^ (^5  / ^)  + OjpQy  X 0^  I M . (14.4) 


Figure  14-1.  Graphic  representation  of  image  terms  of  reference. 
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In  many  instances,  where  0^  is  large,  then  OipQy  ean  be  negleeted.  The  FOR  is  determined  by  the 
limits  of  full  aperture  pointer  motion  plus  one  FOV, 

0,0,  =S,  + 0,0V  ■ (>'*■5) 

The  preeeding  equations  assume  that  the  optical  system  does  not  have  any  field  stops  that  would 
artificially  limit  the  “field  of  view”  to  smaller  values. 

In  almost  every  imaging  system  there  is  a tradeoff  between  FOV  and  resolution.  For 
example,  a modem  digital  eamera  may  have  a focal  plane  array  with  1 024  x 1 024  detector  elements. 
If  this  array  takes  an  image  with  resolution  a (in  prad),  then  the  FOV  of  the  system  can  be  no  larger 
than  1024a.  Projected  to  the  ground  at  range  R,  the  resolution  covers  linear  dimension  (ealled  the 
ground  sample  distance  - GSD)  I = aR  and  the  FOV  eovers  linear  dimension  L=N  aR,  where  N 
is  the  number  of  detectors  in  one  dimension  of  the  detector  array  (=  1024  in  the  example  above). 
If  N =1024,  a = 10  jurad,  and  R = 100  km,  then  I =1  m and  L = 1 km.  If  higher  resolution  is 
required,  for  example,  a = 1 fjrad,  then  I = 10  cm,  but  L = 100  m.  For  a fixed  number  of  detector 
elements  in  a staring  array,  resolution  and  FOV  are  proportional  quantities.  Their  ratio  is  fixed.  If 
one  is  increased,  the  other  must  inerease  by  the  same  faetor.  This  is  illustrated  in  Figure  14-2.  Four 
images  of  aireraft  on  an  airfield  are  presented  with  ground  sample  distanees  of  48  inches,  24  inehes, 
12  inches,  and  6 inches,  respectively.  The  FOV  is  concomitantly  reduced  by  a faetor  2 between  eaeh 
image  to  maintain  the  fixed  ratio. 

The  proportionality  between  FOV  and  resolution  is  not  just  valid  for  staring  array  detectors. 
Consider  a imager  using  a seanned  linear  deteetor  array.  Any  detector  element  must  dwell  on  any 
pixel  in  the  seene  for  a minimum  period  of  time  in  order  to  aequire  suffieient  target  radiation  to  be 
accumulated.  The  quotient  of  the  number  of  deteetors  in  the  array  divided  by  the  dwell  time  gives 
the  maximum  number  of  pixels  per  unit  time  that  may  be  acquired.  The  area  eoverage  per  unit  time 
will  be  the  product  of  the  pixel  area  and  the  maximum  number  of  pixels  per  unit  time  If  high 
resolution  pixels  are  obtained,  then  the  area  eorresponding  to  eaeh  pixel  will  be  small,  and  the  area 
coverage  will  be  small.  If  low  resolution  pixels  are  obtained,  then  the  pixel  area  will  be  larger,  and 
the  area  eoverage  will  be  larger. 

Even  photographie  film  is  subjeet  to  a finite  product  between  resolution  and  area  eoverage. 
The  resolution  of  film  is  determined  by  the  size  of  the  silver  halide  grains  in  its  emulsion.  Eaeh  film 
type  will  have  a specified  resolution  (usually  specified  in  line  pairs/mm).  When  exposed  by  a high 
quality  optical  system,  the  resolution  will  be  determined  by  the  film  resolution.  To  obtain  higher 
spatial  resolution,  the  image  size  at  the  film  must  be  inereased.  Sinee  film  only  eomes  in  fixed  sizes, 
inereased  image  size  must  eome  at  the  expense  of  redueed  field  of  view. 

Thus,  it  is  generally  not  possible  to  design  an  imaging  system  with  arbitrarily  large  FOV  and 
arbitrarily  small  resolution.  For  this  reason  many  imaging  systems  are  provided  with  switchable 
opties  that  can  several  different  fields  of  view  at  proportional  resolutions.  Initial  search  may  be  done 
with  a wide  FOV  (perhaps  250  mrad  at  0.25  mrad  resolution)  while  target  recognition  is  done  with 
a narrow  field  of  view  (perhaps  25  mrad  at  25  prad  resolution). 
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Figure  14-2.  Tradeoff  between  coverage  and  resolution.  [2] 


48-Inch  Ground  Sample  Distance 


24-Inch  Ground  Sample  Distance 
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Figure  14-2  (continued).  Tradeoff  between  coverage  and  resolution.  [2] 


12-Inch  Ground  Sample  Distance 


6-Inch  Ground  Sample  Distance 
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Image-Based  Perception 


Visual  images  are  major  source  of  information  used  by  human  beings  to  perform  just  about 
any  function.  We  can  use  our  vision  to  detect  objects,  to  recognize  specific  objects,  and  to 
determine  the  probable  class  of  unknown  objects.  The  term  "recognition"  is  somewhat  loosely 
defined  in  common  usage.  It  is  sometimes  used  to  denote  the  entire  class  of  human  perceptual 
processes  that  lay  beyond  target  "detection".  In  other  instances  it  carries  a quite  specific  meaning 
and  refers  to  a specific  level  of  perceptual  performance.  Table  14-2  lists  one  hierarchical  ordering 
of  perceptual  processes  and  their  definitions. [3]  Neither  this  hierarchy  nor  the  definitions  are 
universally  accepted.  However,  they  are  in  sufficient  agreement  with  a large  body  of  other  sensor 
designers  that  the  author  sees  no  reason  to  alter  it.  For  the  remainder  of  this  book,  we  will  use  the 
narrow  definitions  as  shown  in  Table  14-2,  unless  it  is  specifically  indicated  that  a broader  definition 
is  being  used.. 

In  an  earlier  paper,  the  author  has  related  this  hierarchy  to  increasing  amounts  of  information 
available  to  the  observer,  although  he  has  not  proven  this  relationship.  Although  unproven,  the 
concept  is  intellectually  satisfying  and  of  demonstrated  utility.  We  will  continue  to  make  this 
association  despite  its  lack  of  proof.  The  hierarchy  begins  with  detection  - meaning  simply  that 
something  is  present  in  the  image  and  we  can  perceive  its  presence,  but  no  other  information  is 
extractable.  If  slightly  more  information  is  available,  the  observer  may  be  able  to  discern  a level  of 
symmetry  in  the  object  (or  lack  thereof)  and  a rough  orientation  (horizontal  or  vertical).  With  more 


Table  14-2.  Levels  of  image-based  object  perception. 


PERCEPTUAL 

LEVEL 

MEANING 

EXAMPLE 

DETECTION 

AN  OBJECT  IS  PRESENT. 

OBJECT 

ORIENTATION 

THE  OBJECT  IS  APPROXIMATELY  SYMMETRIC  OR 
ASYMMETRIC  AND  ITS  ROUGH  ORIENTATION  MAY 
BE  DISCERNED. 

HORIZONTAL 

RECTANGLE 

CLUTTER 

REJECTION 

THE  OBJECT  IS  A POTENTIAL  TARGET  AND  NOT  A 
CLUTTER  OBJECT. 

TARGET 

CLASSIFICATION 

THE  BROAD  CLASS  OF  OBJECT  TYPES  TO  WHICH 
THE  OBJECT  BELONGS  MAY  BE  DETERMINED. 

TRACKED 

VEHICLE 

RECOGNITION 

THE  SUBCLASS  OF  OBJECT  TYPES  TO  WHICH  THE 
OBJECT  BELONGS  MAY  BE  DETERMINED. 

TANK 

IDENTIFICATION 

FRIEND-OR-FOE 

THE  -COUNTRY  OF  MANUFACTURE’  OF  THE  OBJECT 
MAY  BE  DETERMINED. 

WARSAW  PACT 
TANK 

IDENTIFICATION 

THE  SPECIFIC  OBJECT  TYPE  WITHIN  A CLASS  OF 
OBJECTS  MAY  BE  DETERMINED. 

T-72  TANK 

DISCRIMINATION 

REAL  TARGETS  MAY  BE  DISTINGUISHED  FROM 
REPLICA  DECOYS. 

REAL  T-72 

INTENT 

DETERMINATION 

OWNERSHIP  OF  AN  OBJECT  MAY  BE  DETERMINED 
AND/OR  PROBABLE  HOSTILE/NON-HOSTILE  INTENT 
MAY  BE  ESTABLISHED. 

IRAQI  T-72 
(HOSTILE) 
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information,  the  observer  may  be  able  to  tell  that  the  object  is  not  likely  of  natural  origin  - it  is 
probably  man-made  and  therefore  a potential  target  and  not  a clutter  object.  At  a higher  level  of 
perception  we  can  determine  the  broad  class  of  objects  to  which  the  object  belongs.  As  we  progress 
up  the  hierarchy,  we  can  determine  the  subclass,  and  specific  type  of  object.  At  the  highest  level 
of  perception  we  can  discriminate  a real  object  from  a decoy  intended  to  deceive  the  observer  and 
possibly  even  determine  the  likely  intent  of  the  object  from  its  detailed  configuration  and  behavior. 

A governmental  entity  called  the  Automatic  Target  Recognizer  Working  Group  (ATRWG) 
published  a slightly  different  hierarchy  [4]  shown  in  Figure  14-3.  It  has  fewer  levels  but  is  in 
significant  agreement  with  the  trend  of  Table  14-2.  The  four  levels  which  equate  to  detection, 
classification,  recognition,  and  identification  in  Table  14-2  are  used  extensively  in  discussions  of 
imaging  sensor  performance.  The  perceptual  level  of  orientation  was  used  in  early  work  (described 
in  the  next  section),  but  is  seldom  used  today. 

How  we  perceive  and  “recognize”  objects  is  the  study  of  the  science  of  cognition.  We  have 
made  great  strides  in  our  understanding  how  the  visual  system  works  in  transforming  stimuli  into 
nerve  signals,  but  to  date,  we  still  do  not  have  even  the  simplest  workable  concept  for  describing 
how  we  recognize  objects.  Nevertheless,  we  have  been  able  to  perform  numerous  experiments  that 


Figure  14-3.  ATRWG  image-based  discrimination  hierarchy.  [4] 
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have  allowed  us  to  develop  empirical  relationships  for  predicting  perceptual  performance  and 
quantifying  aspects  of  the  visual  system  relevant  to  sensor  design.  The  direct  relationships  to 
perceptual  processes  are  described  in  the  next  section.  Before  doing  this  we  will  present  some  of 
the  results  that  have  influenced  that  work. 

Rosell  & Willson  [5]  found  in  empirical  studies  that  the  probability  of  image-based  detection 
by  human  observers  of  small,  rectangular,  normal  contrast  targets  was  a function  of  signal-to-noise 
ratio  as  shown  in  Figure  14-4.  A detection  probability  of  50%  requires  SNR  of  2.8.  Note:  in  this 
instance  the  author  means  SNR  not  CNR.  If  the  SNR  is  less  than  this,  the  target  to  be  recognized 
will  often  not  be  detected  in  the  first  place.  Electro-optical  systems  designed  to  have  human 
observers  will  typically  satisfy  this  criterion  at  their  design  range. 

Another  empirical  observation  is  that  human  observers  can  only  utilize  a finite  number  of 
gray  levels  (shades  of  gray)  in  images.  Figure  14-5  shows  a sequence  of  images  made  with 
increasing  numbers  of  gray  levels.  [6]  Somewhere  around  64- 128  gray  levels  image  quality  does  not 
continue  to  improve.  Thus,  designing  a system  to  provide  more  128  gray  levels  will  not  improve 
perceptual  performance  and  is  likely  to  be  overdesigned. 


Figure  14-4.  Rosell-Willson  rectangular  target  detection  probability. [5] 


IMAGE  SIGNAL-TO-NOISE  RATIO 
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Figure  14-5.  Comparison  of  images  displayed  using  different  numbers  of  gray  levels.  [6] 


32  Gray  Levels  1 28  Gray  Levels 
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Johnson’s  Criteria 


In  the  mid-1950's,  John  Johnson  at  the  U.  S.  Army  Engineer  Research  and  Development 
Laboratories  at  Fort  Belvoir  (at  last  account  now  called  the  Night  Vision  and  Electronic  Sensors 
Directorate)  performed  a series  of  psychophysical  experiments  on  perception  of  image  intensifier 
images.  [7]  Targets  were  drawn  from  nine  classes  were  placed  at  different  ranges  in  a benign  terrain 
context  and  were  viewed  with  an  image  intensifier  by  a large  number  of  observers.  Each  observer 
was  asked  what  he  saw  in  the  image: 

Was  an  object  present? 

If  yes,  what  did  you  see? 

If  yes,  what  was  the  object? 

At  each  range,  4-bar  test  patterns  (similar  to  that  in  Figure  1 2-3)  were  also  imaged  and  the  minimum 
number  of  resolvable  lines  across  the  minimum  dimension  of  each  target  was  determined.  After 
collecting  a large  number  of  data,  Johnson  statistically  analyzed  them.  He  found  a strong  correlation 
between  how  much  could  said  about  the  target  and  the  minimum  number  of  resolvable  lines 
detected.  The  results  are  shown  in  Figure  14-6.  The  average  over  all  target  classes  was  also 
determined.  Statistically,  he  found  that  if  2.0  ±0.5  resolvable  lines  across  the  minimum  dimension 
could  be  perceived  then  the  observer  had  a 50%  probability  of  detecting  the  target.  If  8.0  ± 1.6 
resolvable  lines  could  be  distinguished  then  the  observer  had  a 50%  probability  of  recognizing  the 
target.  The  four  values 


2.0  resels  per  min.  dimension 

2.8  resels  per  min.  dimension 

8.0  resels  per  min.  dimension 

12.8  resels  per  min.  dimension 


50%  detection  probability 
50%  orientation  probability 
50%  recognition  probability 
50%  identification  probability 


have  become  known  as  Johnson’ s criteria.  These  values  were  observed  to  be  essentially  independent 
of  scene  signal-to-noise  ratio  and  target-to-background  contrast  as  long  as  the  scene  values  and  the 
resolution  chart  values  for  these  parameters  were  roughly  the  same.  Similar  studies  using  other 


Figure  14-6.  Johnson's  criteria. 


TARGET 

(BROADSIDE  VIEW) 

RESOLUTION  ELEMENTS  ACROSS  MINIMUM  DIMENSION  (AT  50%  PROB.)| 

DETECTION 

ORIENTATION 

RECOGNITION 

IDENTIFICATION 

TRUCK 
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16.0 

M-48  TANK 
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14.0 

STALIN  TANK 
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12.0 

CENTURION  TANK 

1.5 

2.4 

7.0 

12.0 

HALF-TRACK 

2.0 

3.0 

8.0 

10.0 

JEEP 
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9.0 

1 1.0 

COMMAND  CAR 

2.4 

3.0 

8.6 

1 1.0 

SOLDIER  (STANDING) 

3.0 

3.6 

7.6 

16.0 

105mm  HOWITZER 

2.0 

3.0 

9.6 

12.0 

AVERAGE  ± STD.  DEV. 

2.0  ± 0.5 

2.8  ± 0.7 

8.0  ± 1.6 

12.8  ± 3.0 
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Figure  14-7.  Resolution  dependence  of  Johnson's  criteria  probabilities. 
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target  sets  have  not  contradicted  Johnson’s  original  results. 

Johnson  also  determined  the  basic  resolution  versus  probability  behavior  of  the  detection, 
recognition,  and  identification  probabilities.  This  behavior  is  shown  in  Figure  14-6. [8]  A small 
table  inset  to  the  figure  summarizes  the  multiplier  (of  the  appropriate  Johnson’s  criterion)  to  convert 
to  probabilities  other  than  50%. 

These  criteria  or  their  most  recently  defined  values  [10]  are  used  as  heuristics  by  design 
engineers  in  estimating  required  aperture  sizes  and  are  utilized  by  most  modern  electro-optical 
performance  prediction  codes  [8- 1 1 ] as  the  principal  criteria  for  determining  detection,  recognition, 
and  identification  probabilities  as  functions  of  range.  They  have  been  assumed  to  be  applicable 
across  the  whole  range  of  electro-optical  sensor  systems.  Little  evidence  has  arisen  to  invalidate  this 
assumption. 

Johnson's  criteria  were  developed  from  studies  of  rasterless  image  intensifier  imagery.  Color 
(which  results  in  significant  added  information)  complicates  their  applicability  to  direct  view  optical 
systems.  In  systems  where  color  is  distorted  (such  as  produced  by  the  use  of  yellow  or  orange  haze 
filters)  or  at  lower  light  levels  (where  scotopic  vision  predominates)  they  are  usually  accepted  as 
valid.  They  are  commonly  accepted  as  valid  for  monochrome  television  and  thermal  imaging 
systems,  despite  the  presence  of  a raster  on  most  displays.  Limited  personal  experience  with 
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multiple-  frame-averaged  laser  radar  reflectivity  images  does  not  contradict  the  validity  of  Johnson's 
criteria,  if  the  higher  angular  resolution  of  the  laser  radar  (see  the  Appendix  for  a discussion  of 
angular  resolution  in  sensors)  is  properly  considered.  Speckled  images  such  as  single-frame  laser 
radar  reflectivity  images  and  synthetic  aperture  radar  images  are  much  less  useful  than 
speckle-averaged  images,  and  are  not  believed  to  obey  Johnson's  criteria,  probably  due  to  their  low 
SNR  (=1).  The  last  two  sections  of  this  chapter  will  elaborate  on  image  SNR  and  its  significance. 
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Applications  of  Johnson’s  Criteria 

The  simplest  and  most  common  use  of  Johnson’ s criteria  is  to  perform  back-of-the-envelope 
estimates  of  minimum  aperture  sizes  for  imaging  sensors.  If  we  combine  any  appropriate  angular 
resolution  expression,  such  as, 

a = XHD  (14.6) 

where  Xis  the  constant  in  the  resolution  expression  {X=  1 .220  for  the  Rayleigh  criterion  and  an  Airy 
function  response)  with  an  expression  for  the  linear  size  / of  an  object  of  angular  size  ^at  range  R 

I = OR  (14.7) 


and  with  the  appropriate  Johnson’s  criterion  N^q  (the  number  of  resolution  elements  across  the 
minimum  target  dimension  to  give  50%  probability  of  performing  a specified  perceptual  task- 
= 2 for  detection)  we  obtain 


R = -^  = -^ 

XAN,, 


(14.8) 


Equation  (14.8)  gives  the  maximum  range  at  which  a particular  Johnson’s  criterion  N;g  can  be 
satisfied  for  a target  of  minimum  dimension  / using  a particular  resolution  criterion  XA/D  for  a 
sensor  with  wavelength  A and  aperture  diameter  D.  This  expression  is  plotted  in  Figure  14-8  for  a 
number  of  sensors.  The  criterion  for  both  recognition  (N^g  =8)  and  identification  (N^g  = 12.8)  have 
been  used.  The  Rayleigh  resolution  criterion  was  used  for  all  sensors  and  the  target  dimension  was 
assumed  to  be  2.5  m. 

The  curves  in  Figure  14-8  represent  the  minimum  aperture  sizes  that  are  consistent  with 
being  able  to  perform  a given  perceptual  function  at  a specified  range.  These  estimates  will  be 
reasonably  accurate  if  the  aperture  is  the  limiting  factor  in  determining  the  total  sensor  resolution. 
If  the  number  or  size  of  the  elements  in  the  detector  array  or  the  bandwidth  of  a filter  in  the  video 
processing  electronics  is  the  limiting  factor,  then  Figure  14-8  is  not  applicable.  However,  in  a well- 
designed  system,  the  aperture  is  usually  the  limiting  factor. 

Examination  of  Figure  14-8  shows  a number  of  interesting  facts.  First,  is  that  8-12  pm 
thermal  imagers  cannot  perform  recognition  or  identification  at  ranges  greater  than  5 km  with 
reasonably  sized  apertures  (20  cm  or  less).  Nevertheless,  we  have  sent  such  sensors  into  combat 
with  weapons  of  longer  effective  range  than  5 km.  Such  cases  have  resulted  in  fratricide  (casualties 
produced  by  friendly  fire).  Thermal  imagers  operating  in  the  3-5  pm  perform  much  better  in  terms 
of  perceptual  performance.  If  recognition  performance  is  desired  at  ranges  of  20-40  km  (desirable 
ranges  for  air  combat)  then  the  imaging  sensor  must  operate  in  the  visible  or  near  infrared.  This  is 
the  primary  reason  that  television  cameras  are  still  used  in  weapons  systems  despite  limitations  on 
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Figure  14-8.  Aperture  size  requirements  for  different  sensors  and  imaging  functions. 


TARGET  RANGE  km 

RECOGNITION  ^ ^ IDENTIFICATION 


their  performance  in  adverse  weather  and  at  night.  A final  observation  is  that  laser  radars  operating 
at  10  pm  should  outperform  thermal  imagers  operating  at  10  pm  in  terms  of  recognition  perfor- 
mance. 


A significant  step  up  in  sophistication  in  the  application  of  Johnson’s  criteria  is  the  “Night 
Vision  Laboratory  Search  Model”.  [11]  This  is  a computer  model  of  the  entire  visual  search  process. 
The  search  field  of  regard  is  broken  into  n separate  sensor  fields  of  view.  Images  are  assumed  to 
be  obtained  for  each  field  of  view.  Each  field  of  view  image  is  viewed  for  an  arbitrary  period  of 
time  until  the  searcher  decides  to  view  the  next  image.  This  time  is  assumed  to  be  roughly  the  same 
for  each  field  of  view.  The  fields  of  view  may  or  may  not  be  revisited. 

If  an  infinite  amount  of  time  could  be  expended  on  searching  the  field  of  regard,  one  expects 
a certain  probability  of  detection,  that  is  given  by  the  expression 


(V/V5of 

1 + (V/V,„f 


(14.9) 
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where  E has  been  empirically  derived  to  be 


E = 2.1  + ().1(N  ! N,^).  (14.10) 

N is  the  number  of  resolvable  cycles  (one  cycle  equals  two  resets)  across  the  target  dimension  and 
is  the  number  of  cycles  across  the  target  for  50%  detection  probability  (essentially  Johnson’s 
criterion  for  detection).  Eq.  (14.10)  is  qualitatively  and  quantitatively  very  similar  to  the  curve  in 
Figure  14-7,  the  original  Johnson  criterion  result.  Equations  (14.9)  and  (14. 10)  are  plotted  in  Figure 
14-9. 

If  a finite  amount  of  time  T can  be  expended  on  searching  the  field  of  regard,  the  detection 
probability  is  less  than  the  infinite  time  probability  by  the  relation 

p(r)  = p.[l-e“”""]  (14.11) 

where  n is  the  number  of  fields  of  view  within  the  field  of  regard  and  r is  the  mean  time  to  detect 


Figure  14-9.  The  infinite  time  probability  function  used  in  the  NVL  Search  Model. 
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a target  within  a single  field  of  view  (if  a target  is  found).  This  time  (in  seconds)  is  given  by 


z = iAlp„.  (14.12) 

The  search  model  is  a static  model.  That  is,  it  assumes  that  there  is  no  target  motion.  Target  motion 
produces  much  higher  detection  probabilities  and  shorter  search  times  than  predicted  by  the  search 
model. 


In  current  versions  of  imaging  system  performance  codes,  Johnson’s  criteria  are  used  in 
slightly  modified  format.  The  modification  is  based  on  a recognition  that  pixels  lying  across  the 
maximum  dimension  of  the  target  also  contribute  to  detection,  recognition,  and  identification. 
Obviously,  knowledge  of  whether  a target  is  “square”  (minimum  and  maximum  dimensions  roughly 
equal)  or  “rectangular”  (minimum  dimension  much  smaller  that  the  maximum  dimension).  Using 
two-dimensional  criteria  requires  slightly  different  numbers. 

A critical  target  dimension  is  defined  by  the  geometric  mean  of  the  target’s  vertical  and 
horizontal  dimensions 

L^  = {L^L,-f- . (14.13) 


The  new  values  for  the  Johnson’s  criteria  are: [10] 


1 .5  resels  (0.75  cycles)  per  critical  dimension 

3.0  resels  (1 .5  cycles)  per  critical  dimension 

6.0  resels  (3.0  cycles)  per  critical  dimension 

12.0  resels  (6.0  cycles)  per  critical  dimension 


50%  detection  probability 
50%  classification  probability 
50%  recognition  probability 
50%  identification  probability 


The  most  sophisticated  application  of  Johnson’s  criteria  is  in  the  imaging  system  perfor- 
mance models  developed  by  the  “Night  Vision  Laboratory”.  Consider  the  model  called  FLIR92/ 
ACQUIRE  which  is  designed  for  thermal  imaging  systems  but  can  be  adapted  to  other  types  of 
sensors.  FLIR92  allows  the  sensor  characteristics  to  be  calculated  and  incorporated  into  a set  of 
“minimum  resolvable  temperature  difference”  (MRTD)  curves.  We  will  discuss  this  aspect  of  the 
calculation  in  more  detail  in  the  chapter  on  thermal  imaging  systems  later  in  this  volume.  At  this 
point  all  we  need  to  know  is  that  these  curves  describe  the  minimum  resolvable  temperature 
difference  between  target  and  background  as  a function  of  the  spatial  frequency  for  a hypothetical 
resolution  pattern  viewed  by  the  sensor.  The  ACQUIRE  portion  of  the  code  calculates  probability 
of  performing  a specific  perceptual  function  in  the  following  manner. 

The  target  is  characterized  by  horizontal  and  vertical  dimensions  and  L y and  a temperature 

difference  AT.  Atmospheric  extinction  (defined  by  extinction  coefficient  a)  is  assumed  to  reduce 
the  real  target  signature  to  an  apparent  value  at  range  R given  by 
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APPARENT  ~ e (14.14) 

The  angular  size  of  the  target  at  range  R is  defined  by  the  equations 

= R and  Oy  = Ly ! R (14.15) 

The  model  begins  at  a small  value  of  the  range.  At  this  range  AT^pp^j^j^j.  is  ealculated.  This  value 
of  AT  is  compared  to  the  MRTD  curves  to  determine  the  horizontal  and  vertical  spatial  frequencies 
(in  cycles/mrad)  that  match  the  apparent  temperature  difference.  At  the  same  range  we  calculate  the 
angular  sizes  of  the  target  using  Eq.  (14. 1 5).  The  products  of  the  spatial  frequencies  and  the  angular 
sizes  gives  the  number  of  resolvable  cycles  across  the  target  at  range  R 


= ylfv  fn^v  (14.16) 

The  number  of  resolvable  cycles  is  translated  into  a probability  using  the  search  model  Eqs.  (14.9) 
and  (14.10).  This  process  is  repeated  for  a number  of  different  range  values  until  a plot  of 
probability  of  “perceptual  function”  versus  target  range  can  be  plotted.  Figure  14-10  illustrates  the 
steps  ((1)  thru  (8))  in  this  iterative  process.  The  approach  can  be  adapted  to  model  any  imaging 
system. 


Figure  14-10.  The  algorithm  used  in  the  ACQUIRE  model  for  imaging  system  performance. 
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Although  it  is  not  an  application  of  Johnson’s  criteria,  and  Johnson’s  criteria  were  not  used 
in  its  formulation,  there  is  another  quantitative  measure  of  perceptual  performance  versus  resolution 
that  is  in  reasonable  agreement  with  Johnson’s  criteria  and  serves  the  validate  the  general  approach. 
This  measure  is  the  Image  Interpretability  Rating  Scale  (IIRS)  used  by  photo-interpreters.  The  scale 
and  its  data  requirements  are  set  forth  in  two  Air  Standardization  Coordination  Committee  standards 
[12]-[13]  and  summarized  in  Figure  14-11.  The  basic  IIRS  tasks  are  detect,  recognize,  identify,  and 
analyze.  The  first  three  are  essentially  identical  to  the  Johnson’s  criteria  tasks  with  the  same  name. 
Analysis  is  the  ability  to  determine  critical  target  parameters  to  sufficient  accuracy  to  support 
technical  intelligence  requirements.  Analysis  would  probably  lie  well  below  the  bottom  of  the  list 
in  Table  14-2. 

Although  it  is  difficult  to  make  a close  comparison,  examination  of  the  resolution  require- 
ments for  detection,  recognition,  and  identification  as  shown  in  Fig  14-11,  indicates  that  the  results 
are  not  in  obvious  disagreement  with  Johnson  ’ s criteria.  It  should  be  noted  that  IIRS  was  developed 
without  any  consideration  (or  probably  even  knowledge  of)  Johnson’s  criteria. 
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Information-Based  Johnson’s  Criteria 


The  material  in  this  section  is  conjecture.  Despite  significant  intellectual  appeal  and  apparent 
agreement  with  limited  predictions,  it  still  lacks  unequivocal  and  incontrovertible  substantiation. 
It  concerns  a methodology  which  permits  the  “Johnson’s  criteria’’  approach  to  perception  to  be 
extended  to  all  forms  of  image-based  perception  from  all  kinds  of  generalized  imaging  sensors  and 
possibly  to  perception  based  on  non-imaging  sources.  Based  on  the  author’s  personal  research,  it 
has  a philosophical  aspect  and  a practical  aspect.  The  philosophical  aspect  involves  the  author’s 
conviction  that  the  nearly  universal  applicability  of  Johnson’ s criteria  derives  from  a basis  in  funda- 
mental principles.  That  is,  that  the  metrics  used  in  Johnson’s  criteria  are  directly  relatable  to  total 
information  content  and  that  total  available  information  drives  perceptual  ability.  The  philosophical 
and  empirical  justifications  behind  the  conjectures  are  explained  in  detail  in  Reference  [3].  In  this 
section  we  will  confine  the  discussion  to  the  mechanics  of  using  the  information-based  Johnson’s 
criteria. 

The  first  step  is  the  determination  of  the  total  information  content  of  an  image.  The  total 
information  Hj.  content  may  contain  contributions  from  intensity  data  (the  individual  pixel  bright- 
nesses Hj  and  their  arrangement  in  space  if^),  the  context  in  which  the  image  is  taken  if color  (the 
variations  in  spectral  content  of  the  signal  in  each  pixel)  H^j,  range  (if  “three-dimensional”  image 
data  is  obtained)  polarization/depolarization  of  the  image  radiation  Hp,  and/or  velocity 
(associated  with  each  pixel)  Hy. 

Hj  — Hq  -\-  ifj  -f  H -\-  H jj  -f  H -f  H p -f  Hy  (14.17) 

Obviously,  not  every  image  contains  every  kind  of  potential  information. 

The  pixel  intensity  information  is  assumed  to  be  given  by  the  product  of  the  number  of 
pixels  on  the  target  N (background  pixels  contribute  to  context  information)  and  the  information 
content  per  pixel  Bp 

= NBj  (14.18) 

In  angle-angle  images  (human  vision,  television,  thermal  imagers,  etc.)  the  number  of  target  pixels 
is  approximately  given  by 

N = N^Ny  = (L^  ! a^R)(Ly!  ayR)  (14.19) 

where  R is  the  range  to  the  target.  Lx  and  Ly  are  the  transverse  dimensions  of  the  target,  and  and 
ay  are  the  horizontal  and  vertical  angular  resolutions  of  the  total  sensor  system.  In  range-cross  range 
images  (e.g.,  synthetic  aperture  radar  images)  the  number  of  target  pixels  is  approximately  given  by 

N = NxN^  = (Lx  / AX)(Z2  / ^R)  (14.20) 
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where  is  the  target  size  in  the  range  dimension,  AZ  is  the  cross-range  resolution,  and  Ai?  is  the 
range  resolution.  The  information  content  should  be  related  to  the  signal-to-noise  ratio  of  the  data. 
From  information  theory  we  expect  the  information  content  in  bits  to  be  given  by 

Bj  x\og2(l  + SNR)  (14.21) 

Ideally  Bj  could  be  as  large  or  as  small  as  the  SNR  dictated.  However,  there  are  practical  limits.  In 
order  for  Johnson’s  criteria  to  be  strictly  applicable,  the  SNR  must  exceed  Rosell’s  SNR  (=2.8).  A 
SNR  = 2.8  gives  Bj  ~2  bits.  Later  in  this  section  we  will  allow  for  lower  values  of  SNR  (such  as  SNR 
=1  for  speckled  laser  radar  images  or  synthetic  aperture  radar  images)  by  an  appropriate  modifica- 
tion of  Johnson’s  criteria.  At  the  upper  end,  the  intrinsic  variability  of  materials  will  provide  an 
effective  limit.  If  one  looks  at  a supposedly  uniform  surface,  there  will  be  small  variations  in 
intensity.  This  may  be  due  to  small  variations  in  surface  texture,  reflectivity,  or  the  cleanliness  of 
the  surface.  These  variations  cannot  provide  information  useful  for  target  detection  or  recognition 
because  two  otherwise  identical  objects  will  show  these  differences,  and  because  those  differences 
will  not  necessarily  be  constant  over  time.  The  surface  variability  limit  is  given  by 

Bj<\og^(jul2cj)  (14.22) 

where  /j.  is  the  mean  intensity  and  cris  the  standard  deviation  of  the  surface  variations.  For  surface 
variability  of  1%  to  3%  of  the  mean,  this  provides  an  upper  limit  of  4-5  bits. 

The  “shape”  information  derivable  from  the  arrangement  of  the  pixels  is  rather  limited.  It 
can  be  shown  to  be  approximately  given  by  the  number  of  ways  N pixels  can  be  arranged  as 
rectangles,  which  is  approximately 

» log,(V^.V,)  (14.23) 

Context  also  conveys  very  limited  information  content.  The  maximum  number  of  contextual 
situations  that  aid  in  the  detection  or  recognition  process  is  limited.  For  example,  being  in  the  sky, 
on  a road,  on  a lake,  or  in  a field  conveys  useful  meaning.  Ground  vehicles  are  seldom  found  in 
lakes  or  in  the  air.  Orientation  of  the  target  with  respect  to  the  terrain  is  also  of  interest.  However, 
whether  the  target  is  in  front  of  one  tree,  two  trees,  or  any  number  of  trees  conveys  the  same  amount 
of  useful  information  as  being  in  front  of  a forest.  In  practice,  contextual  information  is  expected 
to  convey  at  most 

bits . (14.24) 

Sensors  that  measure  information  in  multiple  wavelength  bands  can  provide  color 
information.  The  magnitude  of  this  information  is  provided  by 

H^  = N Np  B^  (14.25) 
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where  is  the  number  of  pixels,  Np  is  the  effective  number  of  “primary  colors”  that  are  detected, 
and  is  the  number  of  bits  of  information  content  per  primary  color.  There  is  a trend  towards  use 
of  multispectral  imagers.  In  such  devices,  images  may  be  obtained  at  dozens  if  not  hundreds  of 
separate  wavelengths.  The  number  of  wavelengths  measured  is  not  the  same  as  the  number  of 
primary  colors  available  in  the  data.  If  the  world  were  covered  with  standard  three-color 
camouflage,  there  would  be  only  three  meaningful  colors,  no  matter  how  many  spectral  bands  into 
which  a sensor  tried  to  partition  the  signals.  Any  observed  spectral  pattern  could  be  shown  to  be 
produced  by  a linear  combination  of  the  responses  to  the  three  primary  colors.  In  practice  the 
number  of  primary  colors  is  probably  of  the  order  of  10.  The  bits  per  color  will  be  subject  to  the 
same  intrinsic  variability  limits  as  intensity  information.  Thus,  5^  is  likely  4-5  bits  unless  limited 
by  low  SNR. 

Some  sensors  can  provide  range  information.  Absolute  range  information  is  to  be 
differentiated  from  relative  range  information.  Knowing  that  a target  is  at  250 1 meters  versus  2500 
meters  tells  us  very  little  that  is  different  from  knowing  that  the  target  is  at  a range  between  2400 
and  2600  meters.  However,  knowing  that  the  front  of  a vehicle  is  6.5  m less  range  than  the  rear  of 
the  vehicle  tells  a great  deal  about  the  target.  The  utility  of  absolute  range  information  is  in 
transforming  angular  information  into  linear  information  {X=aK).  The  information  provided  by 
absolute  range  is  related  to  the  number  of  range  bins  necessary  to  estimate  target  sizes  to  an 
accuracy  of  roughly  1 resel.  This  can  be  shown  to  be  equivalent  to 

^ AR  ~ ^ y)  • (14.26) 

Relative  range  can  yield  considerably  more  information.  If  5R  is  the  range  precision  of  the  sensor 
then  the  information  per  pixel  provided  by  breaking  a target  of  range  depth  into  range  bins  of  size 

SR  is 

Bpp^\og^(L,ISR).  (14.27) 

Velocity  information  is  much  the  same  as  range  information.  Gross  velocity  (body  motion) 
provides  only  limited  information  about  target  class.  For  example,  breaking  the  velocity  spectrum 
into  stationary,  walking  speed,  vehicle  speed,  helicopter  speed,  subsonic  air  vehicle  speed, 
supersonic  air  vehicle  speed,  hypersonic  speed,  and  orbital  velocity  coupled  with  direction  (towards, 
away,  or  transverse  to  the  sensor  line  of  sight)  provides  about  all  the  discrimination  information  that 
one  can  practically  use.  If  we  can  define  motional  velocity  regimes  then  the  gross  velocity 
information  content  is 


= log,  iV„  . (14.28) 

At  most  this  will  contribute  3-5  bits  of  information.  Details  about  internal  motions  of  a target  are 
possible  more  useful.  By  analogy  to  the  relative  range  information,  we  may  define  the  internal 
motional  information  content  per  pixel  to  be 
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SK»log,[(F„-F„J/<5F; 


(14.29) 


where  - F^j^  is  the  spread  in  velocities  across  the  target  and  SF  is  the  velocity  measurement 
precision.  The  total  internal  velocity  information  is 

Hy  = NBy  . (14.30) 


If  a system  is  capable  of  using  polarization,  then  the  bits  per  pixel  of  polarization 
information  is  obviously 


Bp  = 


0 


lxlog2(l  + .SA«) 
[2  x\og^{\  + SNR) 


(1  xmt  / 1 rev) 

(1  xmt  / 2 rev  or  2 xmt  / 1 rev) 
(2  xmt  / 2 rev) 


(14.31) 


depending  on  how  polarization  is  used  and  measured. 

The  total  information  available  from  an  image  is  the  sum  of  all  of  the  relevant  terms  above. 

That  is, 

^ IMAGE  = ^y[Bi  + N pBfj  + Bpp  +By+Bp\ 

+ H^+H(.+  + Hj^ 

The  utility  of  the  information-based  approach  comes  only  after  we  define  a new  set  of  Johnson’s 
criteria.  In  reference  [3]  the  author  derived  the  following  criteria 


Hdet  ~ 5 bits 

(14.33) 

H PEC  ~ 88  bits 

(14.34) 

H IP,  « 306  bits 

(14.35) 

as  the  amount  of  information  needed  to  perform  detection,  recognition,  and  identification  at  the  50% 
probability  level.  When  properly  applied,  these  information-based  criteria  predict  performance  that 
is  almost  identical  to  that  of  the  Johnson’s  criteria  based  solely  on  resolution.  The  information 
content  would  be  determined  and  compared  to  the  information-based  criteria.  T aking  the  square  root 
of  the  ratio  yields  a quantity  roughly  equivalent  to  the  ratio  NIN^g  used  in  the  search  model.  Making 
this  equivalence  explicit,  we  may  use  the  search  model  to  predict  the  infinite  time  probabilities. 
Thus, 
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(14.36) 


To  illustrate  the  use  of  the  information-based  approach,  let  us  first  compare  a thermal  imager 
and  a three-dimensional  imaging  laser  radar.  Both  systems  have  the  same  aperture  and  operate  at 
the  same  wavelength  and  must  produce  50%  recognition  at  some  range.  We  wish  to  evaluate  the 
relative  range  at  which  this  occurs.  For  the  thermal  imager  we  assume  5^  = 4 bits/pixel.  For  the 
laser  radar,  we  have  Bj  = 1 bit/pixel  and  = 5 bits  per  pixel  (an  easily  achievable  range  precision). 

We  will  assume  that  the  resolution  of  the  radar  is  0.73  times  the  resolution  of  the  thermal  imager 

(this  is  the  ratio  of  resolutions  at  the  Rayleigh  criterion).  The  thermal  imager  will  have  pixel- 
dependent  (terms  proportion  to  the  number  of  pixels)  information  content  of 

H^j  ! alR^)  (14.37) 

while  the  laser  radar  will  have  information  content 

/(0.73a (14.38) 


Since  equal  information  content  should  yield  equal  recognition  performance,  the  effective  range  of 
the  laser  radar  is  much  larger  than  that  of  the  thermal  imager 

« 1.68i?„  . (14.39) 

Another  example  involves  estimating  the  recognition  potential  of  a synthetic  aperture  radar 
(SAR),  which  is  not  amenable  to  treatment  using  the  ordinary  Johnson’s  criteria.  We  assume  a 
target  size  of  3 x 7 meters.  For  an  assumed  information  per  pixel  of5^=  1 bit/pixel,  the  information 
content  of  the  SAR  is 

/ AXAT?)  -f  16  bits  (14.40) 

The  16  bits  accounts  for  spatial  arrangement  and  context  information.  Using  the  88  bits  recognition 
criterion,  we  calculate  that  the  SAR  can  achieve  50%  recognition  if  the  resolution  product  obeys  the 
relation 

AXAi?<(7x3)/72  = 0.29  m"  . (14.41) 
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Example  - Atmospheric  Turbulence  Effects  on  Laser  Radar  Images 

As  discussed  in  Chapters  4 and  1 1 , turbulent  scintillation  causes  an  otherwise  non- fluctuating 
(glint)  target  to  have  amplitude  fluctuations  of  magnitude  [14] 

I,  = (14.42) 

where  x is  the  log-amplitude.  The  log-amplitude  is  described  by  a normal  (Gaussian)  distribution 
with  a mean  equal  to  minus  the  variance,  i.e., 


(14.43) 


Recalling  the  definition  of  the  saturation  signal-ro-noise  (Eq.  (11.17)) 


SNRs,r 


(14.44) 


we  can  easily  calculate  [14] 
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(14.45) 


This  function  is  plotted  in  Figure  14-12.  It  takes  on  an  infinite  value  at  zero  turbulence  level.  This 
is  expected,  as  the  target  is  otherwise  non-fluctuating  and  by  definition  has  a zero  variance. 
However,  as  the  scintillation  approaches  saturation,  the  SNR^^j.  plunges  to  extremely  low  values, 
reaching  as  low  as  3 x 1 0 at  saturation.  There  is  virtually  no  usable  signal  left  when  the  SNR  is  that 
low. 


As  discussed  in  Chapter  1 1,  the  presence  of  Rayleigh- fading  and  aperture-averaging  alters 
the  probability  distribution  of  fluctuations  on  a speckle  target  (Rayleigh- fading  target)  to 


(14.46) 
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Figure  14-12.  Glint-target  saturation  signal-to-noise  ratio  vs  turbulence  strength.[14] 


where  F is  a unit-mean,  exponentially  distributed  random  variable  with  distribution 

p^{V)  = e~^  (14.47) 

which  accounts  for  Rayleigh  fading  and  U is  a log-normal  variable  given  by 


Pu(U)  = 


(14.48) 


where  is  the  aperture-averaged  log-amplitude  variance.  In  the  absence  of  turbulence  = 0)  then 

mean  of  the  remaining  exponential  term  is  1 as  is  the  variance.  Thus,  the  saturation  signal-to-noise 
ratio  for  a Rayleigh-fading  target  without  turbulence  is 
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(14.49) 


^^^sSAT  ~ 1 

With  turbulence  included  it  can  be  shown  that  the  saturation  signal-to-noise  ratio  for  the  speckle 
target  is  [14] 


SNR 


sSAT 


(14.50) 


This  function  is  shown  in  Figure  14-13.  As  expected,  when  the  turbulence  log-amplitude  variance 
goes  to  zero,  Eq.  (14.50)  approaches  unity.  It  stays  near  unity  until  the  scintillation  log-amplitude 
variance  becomes  comparable  to  one.  Then,  the  saturation  SNR  falls  to  a value  less  than  2x10  '^  as 
the  scintillation  reaches  saturation. 

Signal-to-noise  ratios  much  less  than  one  are  difficult  enough  to  comprehend  when  dealing 
with  simple  signals  and  noise.  When  dealing  with  images  it  can  be  harder  to  comprehend  the  results. 
Additional  insight  can  be  obtained  by  studying  synthetic  images  degraded  by  high  noise  levels.  In 
Figure  14-14  a synthetic  glint  (non-fluctuating)  target  is  shown  against  a supposedly  uniform 
background  (the  reproduction  of  the  synthetic  images  is  less  than  perfect) . Each  pixel  in  the  original 


Figure  14-13.  Speckle-target  saturation  signal-to-noise  ratio  vs  turbulence  strength. [14] 
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Figure  14-14.  Simulated  glint  target  images  showing  original  image  and 
turbulence-degraded  images  for  five  values  of  the  log-amplitude  variance.  [14] 


image  is  then  multiplied  by  a random  number  where  x is  chosen  from  a Gaussian  distribution 

2 2 

with  mean  - CJ ^ and  variance  + CJ ^ . As  the  turbulence  increases,  the  target  remains  recognizable 

until  the  log-amplitude  variance  becomes  quite  large  (cr^>  0.1).  However,  by  the  time  the 

turbulent  scintillation  reaches  saturation,  the  image  has  virtually  disappeared  into  what  appears  to 
be  a field  of  stars.  It  should  be  noted  that  if  the  target  had  been  imaged  against  zero  background, 
the  target  would  have  remained  detectable.  Scintillation  is  unable  to  produce  bright  spots  if  the 
reflectivity  is  zero.  The  few  bright  spots  produced  by  the  target  would  easily  stand  out  against  the 
non-existent  background.  “Hot  spot”  detection  would  essentially  replace  image  detection  that 
obeyed  Johnson’s  criteria.  In  these  synthetic  images,  the  number  of  resolvable  spots  across  the 
typical  target  dimension  is  approximately  35.  The  corresponding  value  oiN/'N^a  ~24  is  so  large  that 
we  would  expect  to  detect  the  target  even  if  the  signal-to-noise  ratio  were  quite  small.  This  appears 
to  be  the  case.  Although  we  will  not  examine  the  detection  transition  in  detail.  However,  it  appears 
that  these  results  provide  additional  qualitative  support  for  the  information-based  perception  theory. 

Similar  images  can  be  generated  for  speckle  targets.  In  this  case  each  pixel  is  multiplied  by 
two  random  numbers,  one  selected  from  a unit-mean  exponential  distribution  (speckle)  and  one 
selected  from  a log-normal  distribution  (scintillation).  These  images  are  reproduced  in  Figure  14-15. 
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Figure  14-15.  Simulated  speckle  target  images  showing  the  original  image,  the  image  with 
speckle  only  and  turbulence-degraded  images  for  four  values  of  the  log-amplitude  variance.  [14] 


SPECKLE  ONLY  (0-2  = 0) 


0-2=001 


UNDEGRADED  IMAGE 


The  image  quality  remains  almost  constant  until  the  log-amplitude  variance  exceeds  0.1.  Then  it 
degrades  rapidly  until  the  image  is  unrecognizable  at  saturation  of  scintillation. 

Fluctuations  in  traditional  radar  detection  could  often  be  improved  by  pulse  integration.  The 
equivalent  process  in  imaging  is  frame-averaging  in  which  the  results  from  N separate  images  are 
added  on  a pixel-by-pixel  basis  and  then  normalized  by  the  number  of  frames  averaged.  That  is,  if 
4(i,j)  denotes  the  intensity  of  the  pixel  in  the  i*  row  and  the  j*  column  of  the  k*  image,  then  the  i,j* 
pixel  of  the  frame-averaged  image  is  given  by 

= (14.51) 

k=\ 


The  expression  for  the  signal- to-noise  ratio  of  a frame-averaged  image  is  [14] 


SNR{N) 


N-CNR/2 
N-CNR  1 

2 SNRs^^{N)  2 CNR 


(14.52) 
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where  the  frame-averaged  saturation  signal-to-noise  ratio  is  given  by 


SNR,^^(N)  = N ■ SNRs,r{l)  (14.53) 

Essentially  frame-averaging  improves  the  signal-to-noise  ratio  by  roughly  the  number  of  frames  that 
are  averaged. 

Although  it  would  take  an  inordinate  amount  of  frame-averaging  (thousands  of  frames 
averaged)  to  significantly  improve  images  obtained  at  saturated  scintillation,  moderate  amounts  of 
frame  averaging  (of  the  order  of  10  frames  averaged)  would  dramatically  improve  images  that  are 
only  moderately  degraded  by  scintillation.  Speckle-degraded  images  can  also  be  improved  by  frame 
averaging.  Figure  14-16  shows  a single- frame  of  laser  radar  data  clearly  exhibiting  speckle  and  an 
image  made  by  averaging  eight  frames  of  laser  radar  data.  The  speckle  effect  is  clearly  reduced 
producing  a much  more  recognizable  image.  Please  note  that  the  original  images  contained  only 
eight  bits  of  intensity  data.  Given  that  the  original  scene  had  components  spread  over  a dynamic 
range  of  at  least  14  bits,  the  dynamic  range  compression  coupled  with  nonlinearities  in  the 
reproduction  process  make  it  difficult  to  see  the  detail  present  in  the  darker  areas  of  the  original 
images. 


Figure  14-16.  Comparison  of  a single  frame  laser  radar  image  showing  the  degradation  due 
to  laser  speckle  (Rayleigh  fading)  and  the  improvement  obtained  by  averaging  8 frame  of  data. 


SINGLE  FRAME  EIGHT-FRAME  AVERAGING 
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Problems 


14-1.  Acoustic  daylight  is  a term  used  to  describe  imaging  in  which  a noise  source  (e.g.,  wave 
noise)  acts  as  a directional  form  of  illumination.  Reflections  of  this  illumination  from  targets 
can  be  detected  by  sonars.  What  sort  of  image  (physical  property  vs.  dimension  types)  might 
be  formed  by  an  acoustic  daylight  system? 

14-2.  An  electro-optical  imaging  sensor  has  1000  x 1000  detector  elements.  If  the  ground  sampled 
distance  is  1 meter,  how  big  is  the  field  of  view  of  the  sensor? 

14-3.  Image  plane  scanners  can  realistically  scan  through  angles  of  90°  x 90°.  If  the  sensor  with 
such  a scanner  must  cover  a 10°  x 10°  field  of  view,  what  is  the  maximum  allowable 
magnification  of  the  afocal  telescope  through  the  image  is  observed? 

14-4.  Name  six  levels  of  image-based  perceptual  performance.  Rank  these  levels  in  order  of 
difficulty  or  amount  of  target  information  provided. 

14-5.  What  are  the  current  numerical  values  of  Johnson's  criteria  for  detection,  recognition,  and 
identification? 

14-6.  Arrange  the  following  in  order  of  increasing  maximum  range  at  which  they  can  do  target 
recognition  (neglecting  atmospheric  effects): 

a.  A 10  pm  laser  radar  with  aim  diameter  aperture? 

b.  A 5 pm  thermal  imager  with  a 50  cm  aperture? 

c.  A 1 pm  television  with  a 20  cm  aperture? 

14-7.  A new  destroyer  is  to  be  fitted  with  a 3-5  pm  thermal  imager  for  target  identification.  If  the 
infrared  aperture  is  limited  to  15  cm  due  to  stealth  requirements,  what  recognition  and 
identification  ranges  can  be  expected  against  the  following  targets  using  the  newest  versions 
of  Johnson’s  criteria: 

a)  patrol  boat  (bow  aspect  — 8m  wide  x 8 m high 

b)  floating  mine  (0.8  m wide  x 0.4  m high) 

c)  small  boat  (bow  aspect  — 2.7  m wide  x 0.7  m high) 

d)  frigate  (bow  aspect  —14  m wide  x 10  m high) 

What  is  the  probability  of  identifying  any  of  the  targets  if  24  resels  can  be  obtained  across 
the  target? 

14-8.  If  only  aperture  size  is  a variable,  which  thermal  imaging  band  is  more  favorable  for  high 
resolution  imaging  (3-5  pm  or  8-12  pm)?  Why?  The  number  of  resolvable  cycles  across 
a target  is  18,  what  is  the  probability  of  identification?  What  is  the  probability  of  recogni- 
tion? 
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14-9.  An  observation  post  at  the  Korean  DMZ  requires  a TV  system  to  identify  targets  at  15  km 
range.  The  primary  criteria  are  detection  and  recognition.  Determine  the  minimum  size 
aperture  for  a 0.7-0. 9 pm  camera  if  the  nominal  target  size  is  5 meters.  How  many 
resolvable  cycles  across  the  target  are  required  to  detect  this  target  with  95%  probability. 
Is  this  detection  consistent  with  a camera  capable  of  recognition  at  50%  probability? 

14-10.  Which  is  more  likely  to  permit  recognition  at  a longer  range:  a visible  camera  with  a 25  mm 
aperture  lens  or  a 10  pm  thermal  imager  looking  through  a telescope  with  a 1 meter 
diameter?  Atmospheric  propagation  effects  are  assumed  to  be  negligible. 

14-11.  A thermal  imager  has  an  MRT  function  given  by:  MRT  (in  K)  = f (in  cy/mr) 

in  both  horizontal  and  vertical  directions.  Plot  the  probability  of  detection  and  probability 
of  recognition  vs  range  (every  1km  from  0km  to  10km)  for  a 3m  x 3m  target  with  a 3K  AT 
and  atmospheric  extinction  coefficient  of  0.22314km  '.  Use  the  latest  NVL  numbers  for 
Johnson's  criteria. 

14-12.  An  imager  for  use  on  a remotely-piloted  vehicle  (RP  V)  must  produce  IIRS  6 quality  imagery 
over  a down- looking  scan  swath  that  is  60°  wide.  If  the  RPV  flies  at  15  km  altitude,  what 
is  the  worst  case  angular  resolution  that  the  imager  can  tolerate?  If  the  imager  wavelength 
is  10  pm,  what  is  the  minimum  aperture  size  that  is  acceptable? 

14-13.  Assuming  an  intensity- imaging  laser  radar  can  perform  8-frame  averaging,  what  single  factor 
causes  the  recognition  performance  of  the  laser  radar  to  be  considerably  better  than  the 
performance  of  a thermal  imager  operating  at  the  same  wavelength  and  with  the  same 
aperture  size? 

14-14.  A spotlight  synthetic  aperture  radar  (SAR)  can  target  multiple  independent  images  of  a 
target.  The  information-theoretic  Johnson’s  criteria  say  that  if  SNR  can  be  improved,  then 
the  resolution  required  for  target  identification  can  be  relaxed.  If  16  SAR  images  can  be 
obtained  and  averaged,  what  range-cross  range  resolution  product  would  be  compatible  with 
recognition  of  a 3 x 7 meter  target? 

14-15.  A laser  radar  can  integrate  up  to  100  frames  of  target  data.  If  a saturation  SNR  of  4 is 
required  for  a recognition  function  to  be  performed,  what  turbulence  log-amplitude  variance 
can  be  tolerated? 
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CHAPTER  15 


TRACKING 


Tracking  Systems 

The  last  “fundamental”  sensor  funetion  is  tracking.  It  is  a higher  order  function  than 
detection  or  estimation,  involving  multiple  detection  and  estimation  processes.  However,  the 
function  is  so  important  and  involves  such  different  processes  that  it  deserves  separate  discussion. 
Tracking  is  performed  by  tracking  systems.  The  performance  of  such  systems  is  usually  differenti- 
ated in  three  distinct  way:  the  characteristics  of  the  signal  being  tracked,  the  size  of  the  target  being 
tracked  relative  to  the  tracking  window,  and  the  number  of  targets  being  tracked.  Signal  characteris- 
tics include  acoustic  vs.  electromagnetic,  active  vs.  passive,  coherent  vs.  non-coherent,  high 
frequency  vs.  low  frequency.  Targets  which  are  unresolved  by  the  tracking  sensor  are  called  point 
targets.  Point  targets  produce  signals  at  a single  possible  tracking  location.  Targets  which  are 
resolved  or  large  enough  to  produce  signals  over  a range  of  possible  track  positions  are  called 
extended  targets.  In  general,  tracking  systems  that  perform  well  against  point  targets  do  not 
perform  as  well  against  extended  targets,  while  tracking  systems  that  perform  well  against  extended 
targets  do  not  perform  well  against  point  targets,  although  there  are  a few  exceptions.  Many 
tracking  systems  work  well  against  a single  target  but  are  incapable  of  addressing  multiple  targets. 
Other  tracking  systems  can  handle  multiple  targets  as  easily  as  they  handle  single  targets. 

In  T able  1 5- 1 , a number  of  single- target  tracking  systems  are  described.  Note : some  of  these 
techniques  will  be  described  in  more  detail  in  later  chapters.  Here  we  desire  only  to  summarize  the 
different  major  approaches  to  tracking  and  to  elucidate  certain  key  underlying  principles.  Conical 
scan  trackers  are  commonly  used  in  radar  applications  although  the  technique  may  in  fact  be  utilized 
by  almost  any  signal  type.  [ 1 ] Conical  scan  trackers  scan  their  response  pattern  in  a tight  cone.  Any 
“target”  at  the  center  of  the  “con-scan”  produces  no  modulation  in  the  received  signal.  Targets  not 
at  the  center  produce  an  amplitude  modulation  that  is  proportional  to  the  distance  away  from  the 
center.  The  phase  of  the  modulation  can  be  used  to  establish  the  angular  (azimuthal)  position  of  the 
target.  Extended  targets  tend  to  smear  out  the  response  reducing  the  amplitude  modulation  and 
degrading  the  track.  Thus  con-scan  trackers  are  poor  extended  target  trackers.  Sequential  lobing 
is  a tracking  technique  employed  predominantly  in  the  radar  regime.[l]  It  can  be  considered  to  be 
a conical  scan  that  has  first  been  reduced  to  a one  dimensional  scan  and  then  reduced  to  a digital 
scan  between  angular  positions.  Any  target  not  located  at  the  midpoint  between  the  two  positions 
will  produce  a square-wave  amplitude  modulation.  As  in  con-scan  systems,  the  larger  the  angular 
misalignment,  the  large  the  amplitude  modulation.  Monopulse  tracking  systems  are  like  two 
(orthogonal)  sequential  lobing  systems  combined  and  looking  at  all  four  lobes  at  once.  [ 1 ] Either  the 
phase  or  the  amplitude  of  the  incoming  signal  may  be  measured  in  each  of  the  four  channels.  Error 
signals  are  generated  by  taking  sums  and  differences  of  the  four  lobes  (as  indicated).  Phase 
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Table  15-1.  Single  target  tracking  techniques. 
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monopulse  is  the  most  accurate  and  the  most  widespread  technique.  However,  most  visible  and 
infrared  sensors  cannot  implement  phase  monopulse  techniques.  They  can  implement  amplitude 
monopulse  in  the  form  of  a quadrant  detector  array. 

Rosette  scan  is  a pseudo-imaging  technique  (pseudo-imaging  in  the  sense  of  covering  a large 
angular  field  at  high  angular  resolution  - enough  to  create  an  image  - without  actually  forming  or 
using  an  image). [2], [3]  A single  detector  is  scanned  in  a rosette  (flower-like)  pattern.  Target 
location  is  determined  by  the  times  at  which  target  detections  are  made.  Rosette  scanning  is  used 
almost  exclusively  in  the  visible  and  infrared  regions.  A similar  time  based  approach  uses  four  long 
thin  detector  elements  arranged  in  an  array.  When  a scene  is  conically-scanned  around  the  four 
arrays,  the  timing  of  individual  detections  in  the  four  detections  can  yield  both  the  radial  and 
azimuthal  positions. 

Reticles  are  optical  elements  that  act  to  produce  light/dark  modulations.  [2]-[5]  Some  reticles 
are  transparent  with  the  light/dark  modulation  produced  by  alternating  opaque  absorbing  or 
reflecting  regions  with  transparent  regions.  Other  reticles  operate  in  reflection  with  the  light/dark 
modulation  being  produced  by  alternating  reflecting  regions  with  absorbing  or  transparent  regions. 
Reticle-based  scanning  can  take  several  different  forms.  Either  the  scene  can  be  conically  scanned 
around  a fixed  reticle  (con-scan  reticles)  or  the  scene  can  be  imaged  onto  a spinning  reticle  (spin- 
scan  reticles).  Depending  on  their  design,  either  AM  or  FM  modulation  can  be  produced  by  reticle 
systems.  Appropriate  demodulation  can  uniquely  determine  the  radial  and  azimuthal  position  of  a 
target. 


All  of  the  preceding  techniques  tend  to  work  better  against  point  targets  than  against 
extended  targets.  They  are  also  only  capable  of  tracking  a single  target  at  a time.  Imaging  trackers 
tend  to  work  better  against  extended  targets  than  against  point  targets,  although  they  can  track  point 
targets  fairly  easily.  Imaging  systems  are  also  conceptually  capable  of  tracking  multiple  targets 
simultaneously,  although  this  capability  must  be  explicitly  included  in  the  tracking  design. 


The  simplest  form  of  imaging  tracker  determines  the  centroid  of  the  target  and  tracks  the 
motion  of  the  centroid.  The  centroid  (/,  J)  may  be  determined  digitally  from  the  image  I{i,j)  by  the 
relations 


and 
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V'J=1,1  / 'J=M 


(15.1) 


(15.2) 


where  i and  j are  indices  denoting  specific  pixel  locations  in  the  image.  If  the  target  is  small 
compared  to  the  field  of  view,  it  is  desirable  to  gate  out  most  of  the  background  to  avoid  having  the 
background  bias  the  centroid.  Ideally,  the  image  processing  system  would  segment  the  image,  i.e., 
identify  which  pixels  are  background  and  which  pixels  are  target,  and  carry  out  the  integration  only 
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over  the  target  pixels.  An  adaptation  of  centroid  tracking  has  the  image  processing  system  generate 
a binary  image  (an  image  whose  intensity  at  any  pixel  is  either  0 or  1 - typically  determined  by 
comparing  each  pixel  intensity  against  a specified  threshold)  before  calculating  the  centroid.  Binary 
image  formation  could  be  performed  before  or  after  normal  segmentation.  Binary  image  formation 
can  act  as  a form  of  image  segmentation;  it  will  definitely  prevent  bright  areas  on  the  target  from 
biasing  the  centroid.  An  analogue  form  of  centroid  tracking  can  be  accomplished  by  using  gated 
integrators  on  the  video  signal.  In  a gated  video  tracker,  four  integrators  integrate  different  parts 
of  the  video  signal.  Timing  of  the  integrator  gates  is  such  that  the  four  integrators  each  integrate  on 
quadrant  of  a rectangular  area  containing  the  target.  The  four  integrators  act  just  like  a quadrant 
detector  (amplitude  monopulse)  system.  The  centroid  can  be  determined  from  the  sums  and 
differences  of  the  integrated  signals  recorded  by  the  four  integrators. 

Another  image  tracker  has  the  image  processing  system  segment  the  image  and  then  apply 
an  edge  detector  to  the  segmented  target.  The  centroid  of  the  edges  is  then  tracked.  Proponents  of 
edge  trackers  claim  superior  performance.  The  author  has  not  seen  convincing  justification  for 
these  claims. 

A different  image  tracking  technique  is  correlation  tracking.  In  a correlation  tracker,  the 
image  I{i,  j)  is  cross  correlated  with  a reference  image  R{i,  j)  producing  a correlation  function 

N,M 

C(i'J')=  (15.3) 


The  coordinates  (/  ( j ’)  at  which  the  cross  correlation  function  is  a maximum  describes  the  shift  (in 
pixels)  between  the  image  and  the  reference.  If  the  assumption  is  made  that  any  shift  is  due  to  target 
motion,  then  this  shift  is  also  the  number  of  pixels  that  the  imager  must  be  “moved”  to  reposition 
the  target  at  the  “reference  point”.  An  a priori  stored  image  could  be  used  as  the  reference. 
However,  in  most  systems,  the  reference  image  is  the  immediately  preceding  frame  of  image  data. 
This  has  the  benefit  of  allowing  slow  variations  in  target  scale  and  orientation  to  be  accommodated. 
The  frame-to-  frame  changes  will  be  small  enough  that  the  peak  of  the  correlation  function  is  only 
minimally  affected. 

Feature-based  trackers  assume  the  existence  of  an  automatic  recognizer.  For  the  most  part 
they  use  one  of  the  image-based  tracking  schemes  described  above.  However,  they  also  tag  each 
track  file  with  key  identifying  features  and  compare  the  features  in  each  new  image  frame  with  those 
of  the  prior  frames.  This  permits  a target  to  be  tracked  even  if  it  passes  behind  an  obscuration.  The 
loss  of  the  key  features  indicates  the  target  has  disappeared  and  the  tracker  should  go  into  a “coast 
mode”  in  which  it  follows  the  last  valid  tracking  commands  and  gradually  loosens  up  any  tracking 
gates.  If  the  target  reappears  from  behind  the  obscuration,  the  reappearance  of  the  identifying 
features,  guarantees  that  the  proper  target  is  being  tracked  and  a new  track  file  is  unnecessary. 

If  properly  designed  image-based  trackers  can  handle  tracking  of  multiple  targets.  This  is 
not  true  of  the  other  tracking  techniques  in  Table  15-1.  They  can  only  handle  multiple  targets  by 
fielding  multiple  tracking  systems  (one  per  target  to  be  tracked).  In  Table  15-2  the  techniques 
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amenable  to  multiple  target  tracking  are  described.  As  with  single  target  trackers,  some  multiple- 
target  tracking  techniques  work  better  against  point  targets;  others  work  better  against  extended 
targets. 


The  dominant  multiple-target  tracking  technique  is  track-while-scan  (TWS).  The  name 
accurately  describes  the  way  it  works.  The  sensor  scans  its  field  of  view  continuously.  No  special 
attention  or  action  is  required  to  permit  tracking.  After  a scan  of  the  complete  field  of  a set  of 
“detections”  or  threshold  crossings  has  been  established.  On  the  next  scan,  each  detection  is 
associated  with  a detection  in  the  set  from  the  preceding  scan.  If  an  old  detection  cannot  be 
associated  with  a current  detection,  it  becomes  a candidate  to  be  dropped  as  a current  track  file.  If 
a new  detection  cannot  be  associated  with  and  old  detection,  it  becomes  a candidate  for  establish- 
ment of  a new  track  file.  After  several  scans  and  associations,  enough  data  exists  to  initialize  a 
tracking  filter  for  each  set  of  scan-to-scan  persistent  detections.  On  successive  scans,  detections 
are  associated  with  track  files  (assisted  by  predicted  positions  from  each  tracking  filter)  and  used 
to  update  the  tracking  filters,  or  they  are  tagged  to  potentially  establish  new  track  files. 

Track- while- scan  can  be  implemented  in  a number  of  forms.  Azimuth-only  TWS  can  be 
generated  by  radio-frequency  interferometers  (direction  finders),  passive  sonars,  and  radars  that  are 
being  affected  by  range-denial  jammers.  Most  surface  search  radars  and  some  air  search  radars  can 
be  used  to  perform  azimuth-plus-range  TWS.  Infrared  search  and  track  systems  without  associated 
rangefinders  perform  azimuth-plus-elevation  TWS.  Some  sophisticated  air  defense  radars,  aircraft 
fire  control  radars,  laser  radars,  and  infrared  search  and  track  systems  with  associated  rangefinders 
there  is  the  ability  to  perform  three-dimensional  TWS  (range-plus-azimuth-plus-elevation). 

Systems  with  phased  array  antennas  have  the  potential  to  do  multiple  target  tracking  using 
interferometric  beam  forming.  Using  either  the  whole  array  or  a small  subset  of  the  whole  array 
an  antenna  lobe  can  be  constructed  in  any  desired  direction  by  properly  shifting  the  phases  of  the 
signals  received  by  the  array  elements  involved.  An  object  in  such  an  antenna  lobe  can  be  tracked 
by  the  equivalent  of  a monopulse  technique  or  by  a scanning  technique  analogous  to  sequential 
lobing  or  conical  scanning  by  suitable  modulation  of  the  phases  and  observing  the  modulations  they 
produce  in  the  signal.  As  target  motions  are  detected,  the  element  phases  may  be  varies  to  keep  the 
target  in  the  lobe.  As  more  targets  need  to  be  tracked,  more  antenna  lobes  can  be  synthesized.  This 
is  easier  to  say  than  it  is  to  do.  If  the  entire  array  is  used  to  synthesize  the  tracking  lobes,  then  gain 
will  be  reduced  as  the  number  of  lobes  increases.  If  a subset  is  used  for  each  lobe,  then  each  lobe 
will  suffer  a gain  reduction  proportional  to  the  fraction  of  the  array  elements  used. 

Most  image-based  tracking  techniques  can  be  used  to  track  multiple  targets  just  as  they  can 
track  single  targets.  Point  targets  can  be  detected  in  images  and  tracked  using  tracking  filters  just 
as  in  an  azimuth-plus-elevation  TWS  system.  The  limitation  on  the  number  of  extended  targets  that 
can  be  tracked  by  other  imaging  techniques  depends  on  the  amount  of  image  processing  power  that 
can  be  brought  to  bear.  Calculation  of  a correlation  function  is  computationally  intensive.  The 
degree  of  computation  difficulty  will  scale  with  the  number  of  targets  that  must  be  processed. 
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Table  15-2.  Multiple  target  tracking  techniques 
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Tracking  Filters 


Tracking  filters  are  mathematical  post-processing  that  is  performed  to  improve  a tracking 

systems  ability  to  estimate  the  state  vector  of  targets.  The  state  vector  X is  a quantity  describing 
the  dynamics  of  the  target  at  the  level  of  concern  to  the  problem  at  hand,  i.e., 

X = {X„X,,-,X„]  (15.4) 

where  the  X elements  of  dynamic  motion.  The  simplest  form  of  state  vector  might  be  the 
location  of  the  target  and  its  velocity.  If  the  sensor  were  only  capable  of  measuring  angle  to  the 
target,  the  state  vector  might  consist  of  the  target  angle  and  the  angular  velocity, 

X = {6>,  (9}  (15.5) 

where  the  dot  denotes  a derivative  with  respect  to  time.  If  acceleration  were  deemed  important  to 
the  problem,  the  state  vector  would  consist  of  the  target  angle,  the  angular  velocity,  and  the  angular 
acceleration 


X — 1^,  0,  . (15.6) 

If  the  system  were  able  to  measure  range  and  angle,  then  the  state  vector  might  be 

X = {T?,  0,  (j),  R,  0,  (f),  R,  0,  . (15.7) 

The  state  vector  for  a target  maneuvering  relative  to  a sensor  on  a maneuvering  platform  might  get 
very  complicated. 

Track-while-scan  systems  suffer  from  three  fundamental  difficulties:  track  precision 

limitation,  track  loss  due  to  missed  detections  or  false  alarms,  and  difficulty  in  frame-to-frame  target 
association.  In  any  real  sensor  system  there  will  be  errors  in  measuring  the  state  vector.  Without 
a track  filter,  the  track  error  can  be  no  smaller  than  the  measurement  error  (and  this  is  often  not  good 
enough).  Note  that  it  is  not  really  possible  to  completely  eliminate  a tracking  filter.  If  a human 
being  looks  at  the  data,  even  if  only  to  connect  the  dots  on  a plot,  that  person  will  act  as  a tracking 
filter.  However,  a computer-based  filter  can  usually  improve  performance  more  than  the  human. 

Track- while- scan  systems  rely  on  the  ability  to  detect  an  object  time  and  again  and  associate 
a detection  in  one  data  interval  (e.g.,  an  image  frame  which  equals  one  scan  of  the  field  of  view) 
with  a detection  in  another  interval.  If  the  object  is  not  detected  in  a frame,  what  decision  is  to  be 
made?  Did  the  target  exit  the  scene?  Did  it  crash  (not  an  impossible  outcome)?  Did  the  sensor 
simply  fail  to  detect  the  target?  What  if  a false  alarm  (remember  that  the  sensor  cannot  tell  a false 
alarm  from  a real  target)  occurs  in  a frame?  Is  this  a new  target?  Which  of  the  two  detections  is  the 
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target?  Without  a tracking  filter,  it  is  difficult  to  make  such  decisions.  With  a filter,  they  may  not 
need  to  be  made  at  all.  The  problem  lies  in  the  need  to  associate  detections  between  frames. 


Consider  the  sequence  of  detections  shown  in  Figure  15-1.  Two  targets  are  converging. 
They  appear  to  be  following  slightly  curve  trajectories,  but  because  of  sensor  noise  it  is  not  possible 
to  rule  out  straight  line  motion.  Which  of  the  two  possible  sets  of  associations  is  correct?  A 
tracking  filter  might  shed  light  on  this  question  by  (1)  providing  the  best  estimate  available  of  target 
position  when  the  critical  association  decision  needs  to  be  made  (at  frame  5),  and  (2)  by  having 
estimates  for  higher  order  terms  in  the  dynamics  of  the  target  which  would  shed  light  on  whether 
or  not  a maneuver  appeared  to  be  underway.  This  is  not  to  say  that  a tracking  filter  cannot  make  a 
mistake  in  such  “pathological”  situations,  but  that  is  will  likely  do  better  than  the  human  processor 
would  do  (barring  luck  or  good  intuition). 


Figure  15-1.  A confusing  sequence  of  detections  of  two  targets.  Did  the  target 
trajectories  cross  or  did  they  swerve  to  avoid  a potential  collision? 

Which  target  is  which  at  the  end  of  this  sequence? 
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Consider  the  plot  shown  in  Figure  15-2.  This  plot  contains  9 frames  of  track  data  with  10 
real  targets.  The  10  real  targets  are  detected  with  a detection  probability  of  95%.  Thus  there  are 
roughly  5 missed  detections.  The  plot  also  has  an  average  of  1 false  alarm  per  frame.  The  reader 
is  invited  to  attempt  his  own  fit  to  the  points.  The  actual  tracks  are  overlaid  on  this  plot  in  Figure 
15-3.  It  is  obvious  that  in  more  than  one  instance,  it  would  be  difficult  to  make  an  unequivocal 
association  of  detections  to  a track.  For  example,  in  the  upper  right  hand  comer,  just  as  two  tracks 
cross,  there  is  a missed  detection.  Which  track  actually  had  the  missed  detection?  In  the  center  of 
the  plot  there  are  several  track  crossings  that  are  actually  close  encounters.  Again  the  question  arises 
of  which  detection  belongs  to  which  track.  Although  a mistaken  association  might  not  cause  a loss 
of  track  in  this  example,  it  will  increase  the  error  in  predicting  the  track.  Now  consider  the  problem 
facing  an  air  controller  which  may  have  several  hundred  targets  being  tracked  simultaneously.  It 
is  problems  like  this  that  have  made  tracking  filters  a necessity. 

Consider  the  generic  tracking  filter  described  in  Figure  15-4. [6]  The  actual  target 

dynamics  will  produce  a state  vector  X . The  observation  vector  Y consists  of  those  components 
of  the  state  vector  that  the  sensor  measures  plus  any  noise 

Y = UX  + N (15-8) 


Figure  15-2.  A sequence  of  9 frames  of  track  data  with  10  real  targets, 

5 missed  detections  (Pp  = 0.95),  and  an  average  of  1 false  alarm  per  frame. 
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Figure  15-3.  The  plot  of  Figure  15-2  overlaid  with  the  actual  tracks. 


Figure  15-4.  Functional  flow  diagram  of  a generic  tracking  fdter. 
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where  H is  the  transfer  function  of  the  sensor  and  N is  the  sensor  noise  vector.  The  measurement 
transfer  function  is  the  mathematical  description  of  the  manor  in  which  the  sensor  translates  actual 
state  vector  information  into  measurement  information.  For  example,  a pure  Doppler  radar  would 
transform  a state  vector  by  ignoring  position  and  acceleration  terms,  and  then  projecting  the  velocity 
terms  onto  the  vector  describing  the  line  of  sight  to  the  target.  That  is,  it  converts  the  three- 
dimensional  velocity  into  a radial  velocity. 

A residual  Y is  the  difference  between  a predicted  observation  HX p and  the  actual 

observation.  The  residual  is  usually  checked  for  consistency;  it  should  be  small.  If  the  residual  is 
so  large  that  it  cannot  reasonably  be  due  to  valid  target  motion  (e.g.,  surface  targets  seldom  travel 
at  Mach  1 and  aircraft  do  not  normally  travel  at  Mach  10),  then  the  current  observation  may  be  due 
to  a false  alarm  or  a new  target.  Such  inconsistent  observations  should  not  be  included  in  the  track 
file.  The  residual  might  also  be  used  to  adjust  filter  gains  in  an  adaptive  fashion.  The  predicted  and 
actual  observations  and  the  residual  are  inputs  to  the  filtering  processor.  In  a filter,  a fraction  of  the 

residual  is  added  to  the  predicted  observation  to  establish  a smoothed  estimate  A: | A:)  = X^of 

the  current  state  vector.  The  fraction  of  the  residual  added  to  the  predicted  observation  is 
determined  by  the  gain  coefficient.  Addition  of  less  than  the  complete  residual  means  that  only  a 
portion  of  any  measurement  error  will  be  included  in  the  new  estimate.  Over  time,  the  increased 
reliance  on  prediction  versus  actual  measurement  will  lead  to  a reduction  in  the  variance  of  the 
estimated  state  vector.  This  reduction  in  variance  of  the  estimated  position  versus  the  variance  in 
the  observed  position  is  called  smoothing.  The  smoothed  estimate  of  the  state  vector  is  combined 
with  the  expected  target  dynamics  (all  filters  must  assume  a fixed  type  of  target  motion  such  as 
uniformly  accelerated  linear  motion,  constant  velocity  circular  motion,  etc.)  to  produce  a predicted 

state  vector  X p for  the  next  measurement  time.  The  predicted  state  vector  is  transformed  using  the 

transfer  function  to  establish  the  new  predicted  observation.  After  a new  measurement  has  been 
made,  the  cycle  is  ready  to  repeat. 

There  are  two  distinct  forms  that  tracking  filters  can  take:  fixed-coefficient  or  adaptive 
coefficient  filters.  In  a fixed-coefficient  filter,  the  gains  (coefficients  in  the  smoothing  algorithm) 
are  fixed  in  advance.  In  an  adaptive  coefficient  filter  the  gains  are  gradually  varied  up  and  down 
depending  on  the  residuals.  When  the  residuals  are  small,  the  gain  coefficients  are  reduced  until  the 
residuals  start  to  increase.  If  the  residuals  are  large,  the  gain  coefficients  are  increased  until  the 
residuals  stop  decreasing  after  a coefficient  increase.  The  Kalman  filter  is  the  most  common 
implementation  of  an  adaptive  coefficient  algorithm. 

As  indicated  above,  all  tracking  filters  assume  a set  of  target  dynamics.  The  filters  use  the 
dynamical  equations  to  predict  ahead  in  time  to  estimate  what  the  new  state  vector  should  be  based 
on  the  old  estimate.  If  the  target  motion  does  not  match  the  dynamics  for  any  extended  period  of 
time,  the  prediction  will  get  worse  and  worse  and  the  residuals  will  get  larger  and  larger.  For 
example,  a common  assumption  is  straight-line,  uniformly  accelerated  motion.  The  dynamical 
equations  are 
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X(t)  = 0.5A(oy  + v(0)t  + X(0) 

(15.9) 

V(t)^  A(0)t  + V(0) 

(15.10) 

II 

(15.11) 

or  in  terms  of  a dynamics  matrix  D and  state  vector  X — ^X,V,  A j 


O 

to 

1 

X(0) 

1(?)  = D-X(0)  = 

0 1 t 

r{o) 

0 0 1 

A{0) 

(15.12) 


A ballistic  missile  would  be  assumed  to  follow  a ballistic  trajectory  after  motor  burnout.  A non- 
maneuvering satellite  would  be  assumed  to  follow  a trajectory  consistent  with  orbital  dynamics.  A 
target  aircraft  performing  a breakaway  maneuver  might  be  expected  to  follow  a uniform  centripe- 
tally  accelerated  trajectory  for  a short  period  of  time.  If  the  precise  dynamics  cannot  be  determined 
then  it  often  becomes  necessary  to  simultaneously  run  parallel  tracking  filters  with  different 
estimated  target  dynamics  and  select  the  output  at  each  sample  that  gives  the  smallest  residual. 
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Fixed-Coefficient  Filters 


The  simplest  tracking  filters  to  understand  are  the  fixed-coefficient  filters.  The  simplest 
fixed-coefficient  filters  are  the  a- (5 and  a-/?- /filters.  Before  describing  these  filters  in  detail,  we 
must  define  a number  of  terms: 

X {k\j^  — Estimate  of  target  state  vector  at  sample  k given  samples  through  j 

Xs  Smoothed  estimate  of  state  vector  at  sample  k 

Xq  {k^—  Observed  value  of  state  vector  at  sample  k 
X p{k  + l)  = Predicted  value  of  state  vector  at  sample  k+1 
qT  — Time  since  last  valid  sample  {T=  sample  time  interval) 

a,  P,y  = Fixed  (but  selectable)  filter  coefficients 
We  will  use  these  definitions  throughout  this  section. 

Now  let  us  begin  by  describing  the  a-P  tracking  filter.  This  filter  assumes  simple 
unaccelerated  motion  (in  a straight  line).  The  dynamical  equations  are 

X(f)  = F(0)f  + X(0)  (15.13) 

and 

V(t)^V(0).  (15.14) 


Measurements  are  made  only  of  the  X component  of  the  state  vector  X = |X,  V } . The  filter 
algorithm  goes  as  follows  [6]: 

The  smoothed  estimate  of  the  position  Xis  developed  by  adding  a fraction  a of  the  residual 
{Xq-Xp)  to  the  original  prediction 

X^iji)  = X{k\k)  = X piji)  + - Xp(A:)] . (15.15) 

A smoothed  estimate  of  the  velocity  V is  obtained  by  developing  a velocity  “residual” 
(position  difference  divided  by  time  increment)  and  adding  a fraction  p of  that  velocity 
residual  to  the  prior  smoothed  estimate  of  velocity 
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(15.16) 


V,(k)=  k{k\k)  = V,(k-\)  + !{x„{k)- X ,{k)]l  qT . 

The  predicted  position  of  the  next  observation  is  determined  by  using  the  dynamical 
equations  and  inserting  the  smoothed  estimates  of  the  parameters 


Xp{k  + 1)  = X(k  + 1|  A:)  = Xs(k)  + TVs{k) . (15.17) 

This  prediction  is  combined  with  the  next  observation  to  determine  a residual  that  begins  the 
process  again.  This  loop  is  repeated  over  and  again  until  tracking  is  no  longer  required. 
This  iterative  algorithm  needs  initiation.  To  initiate  the  loop,  one  assumes  the  following 
initial  conditions.  The  initial  smoothed  position  estimate  is  assumed  equal  to  the  first 
prediction  and  both  are  set  equal  to  the  first  measurement 

Vj(0)  = V,.(l)  = V„(0).  (15.18) 


The  initial  smoothed  velocity  is  assumed  to  be  zero,  with  the  subsequent  smoothed  estimate 
being  calculated  from  the  finite  difference  approximation  to  the  first  derivative  using  the  two 
position  observations  as  inputs. 


and 


F,(0)  = 0 


dX^ 


X(l)-^o(0)]/7’. 


(15.19) 

(15.20) 


Once  initialized  the  a-/? tracking  algorithm  can  be  cycled  from  observation  A:  to  observation  k+\  as 
long  as  desired. 

The  a-J3-y  tracking  filter  assumes  uniformly  accelerated  straight  line  motion.  The 
dynamical  equations  are  Eq.(15.9)  through  (15.11).  Again  measurements  are  made  only  of  the 

position  component  of  the  state  vector  X — {X,  F,  Aj . The  filter  algorithm  goes  as  follows  [6]: 

The  smoothed  estimate  of  the  position  Xis  developed  by  adding  a fraction  a of  the  residual 
(Xq-Xp)  to  the  original  prediction 


Xs{k)  = Xp{k)  + a[Xo{k) - Xp{k)' . 


(15.21) 


A smoothed  estimate  of  the  velocity  V is  obtained  by  developing  a velocity  “residual” 
(position  difference  divided  by  time  increment)  and  adding  a fraction  p of  that  velocity 
residual  to  the  sum  of  the  prior  smoothed  estimate  of  velocity  and  the  estimated  velocity 
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change  due  to  acceleration 


V,(k)  = V,{k-l)  + qTA,{k-l)  + l\X„(k)- X,{k)\l  qT . (15.22) 


A smoothed  estimate  of  the  acceleration  A is  obtained  by  developing  an  acceleration 
“residual”  (based  on  the  velocity  residual  divided  by  the  time  increment)  and  adding  a 
fraction  y of  that  acceleration  residual  to  the  prior  smoothed  estimate  of  the  acceleration. 

A,{k)^A,{k-\)  + r[Xo{k)-X,{k)]l{qT)-.  (15.23) 

The  predicted  position  of  the  next  observation  is  determined  by  using  the  dynamical 
equations  and  inserting  the  smoothed  estimates  of  the  parameters 


Xp(k  + 1)  = X^(A:)  + TVs(k)  + 0.5  As{k) . (15.24) 

This  prediction  is  combined  with  the  next  observation  to  determine  a residual  that  begins  the 
process  again.  This  loop  is  repeated  over  and  again  until  tracking  is  no  longer  required. 
This  iterative  algorithm  needs  initiation.  To  initiate  the  loop,  one  assumes  the  following 
initial  conditions.  The  initial  smoothed  position  estimate  is  assumed  equal  to  the  first 
prediction  and  both  are  set  equal  to  the  first  measurement 

.r,(o)  = .r,(i)  = Xo(o).  (15.25) 

The  initial  smoothed  velocity  is  assumed  to  be  zero,  with  the  subsequent  smoothed  estimate 
being  calculated  from  the  first  two  position  observations, 

F,(0)  = 0 (15.26) 

and 

F,(l)  = [X„(l)-X„(0)]/7’.  (15.27) 

The  first  two  initial  smoothed  accelerations  are  assumed  to  be  zero,  with  the  subsequent 
smoothed  estimate  being  calculated  from  the  finite  difference  equation  for  the  second 
derivative  using  the  first  three  position  observations, 

24,(0)=  4(1)  = 0 (15.28) 


and 
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^^(2) 


d^Xo 

1 

dXo 

dt^ 

(2) 

dt 

(2) 

dt 

0)_ 

l[X„{2)  - A'„(l)]  / T)  - ([X„(l)  - X„(0)]  / t) 


IT.  (15.29) 


= [X„(2)  + Xo(0)-2X„(l)]/r^ 

Once  initialized  the  a-yS-;' tracking  algorithm  can  be  cycled  from  observation  k to  observation  k+\ 
as  long  as  desired. 

How  good  is  the  performance  of  these  simple  filters?  It  can  be  shown  that  the  optimum 
choice  of  p given  an  arbitrary  choice  of  a is 

y?=2(2-ct)-4Vl-«  (15.30) 

for  either  the  a-fi  or  the  a-/?-y  filters. [4]  Given  both  a and  /?  the  optimum  choice  of  y in  an  ctr-y5-y 
filter  is 

y - /3^ ! 2a  . (15.31) 

The  optimum  /?  and  ycoefficients  are  shown  as  functions  of  the  a coefficient  in  Figure  15-5. 

The  smoothing  efficiency  for  a component  of  the  state  vector  is  defined  as  the  variance  of 
the  smoothed  estimates  of  that  component  divided  by  the  variance  of  the  observations  of  that 
component.  In  the  absence  of  direct  observations  of  a component,  then  the  finite  difference 
approximation  to  the  derivative  defining  that  component  is  substituted  along  with  the  observations 
to  determine  the  smoothing  efficiencies.  For  the  a-fTy  filter  the  position  smoothing  efficiency 
(variance  reduction)  is 


^2a^+/?(2-3a) 

a[A-p-2a] 

and  the  velocity  smoothing  efficiency  (variance  reduction)  is 
K 

" T^a[A-p-2a\ 


(15.32) 


(15.33) 
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Figure  15-5.  Optimum  choices  of  beta  and  gamma  coefficients 
given  an  arbitrary  selection  of  the  alpha  coefficient. 
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These  two  smoothing  efficiencies  are  plotted  as  functions  of  the  yet  to  be  selected  a 
coefficient  in  Figure  1 5-6.  As  a decreases  the  position  smoothing  efficiency  becomes  smaller  and 
smaller  (meaning  the  variance  is  reduced  more  and  more).  It  is  a quite  respectable  1 0%  at  a = 0. 1 . 
The  velocity  smoothing  falls  even  faster  than  the  position  smoothing. 

This  would  suggest  that  the  optimum  performance  would  be  obtained  by  making  a as  small 
as  possible.  Unfortunately,  with  the  good  (smoothing)  there  comes  the  bad.  In  a tracking  filter,  the 
“bad”  takes  the  form  of  response  time  and  hangoff  error.  The  gain  coefficient  a is  the  fraction  of 
the  residual  that  is  added  to  the  prediction  to  make  the  smoothed  estimate.  This  means  that  any 
observation  takes  1/ a measurement  intervals  before  its  effects  become  negligible.  An  alternative 
view  would  be  that  it  takes  1 /or  measurement  intervals  before  any  change  becomes  apparent.  There 
is  a non-zero  response  time  tg  associated  with  a tracking  filter  that  is  roughly  given  by 

t^  = T/a.  (15.34) 
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Figure  15-6.  Degree  of  smoothing  (variance  reduction)  achievable  in  an  a-P  tracking  filter. 
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Filters  with  good  smoothing  characteristics  also  tend  to  have  long  response  times.  Response  time 
can  become  critical  if  the  target  has  any  ability  to  maneuver  or  change  a parameter  of  its  dynamics. 
If  the  target  motion  changes,  the  filter  will  make  no  appreciable  change  in  its  output  or  predictions 
until  a response  time  has  elapsed.  If  the  motion  change  is  too  great,  a slow  response  time  can  permit 
the  residual  to  grow  to  a value  deemed  unphysical  and  the  track  will  be  lost. 

Hangoff  is  another  potential  problem.  If  the  filter  designer  has  mis-estimated  the  dynamics 
of  the  target,  then  hangoff  may  occur.  Consider  the  a-fi  filter.  If  the  target  actually  has  a constant 
non-zero  acceleration  in  motion,  then  the  filter  will  never  accurately  predict  the  future  position.  The 
acceleration  will  cause  the  target  to  stay  one  jump  ahead  of  the  tracker.  This  is  hangoff.  The 

hangoff  error  in  an  a-fi  tracker  for  a constant  acceleration  X is  given  by 


p 


(15.35) 
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Hangoff  error  and  response  time  for  an  a-P tracker  are  shown  in  Figure  15-7.  Both  get  worse  as  a 
gets  smaller.  The  choice  of  a must  result  from  a tradeoff  between  the  degree  of  smoothing  desired, 
the  acceptable  response  time,  and  the  magnitude  of  any  potential  hangoff  error  that  may  be  present. 
In  many  cases  a choice  of  0.1  < or  < 0.2  is  a reasonable  compromise. 


The  optimum  coefficients  for  the  a- P-y  filter  are  the  same  as  those  for  the  a-P  filter  and  have 
already  been  given  in  Figure  15-5.  The  smoothing  efficiencies  for  this  filter  can  be  determined  from 


1^0^  ^p{pi-^a^-ra\A-p-la\ 
p-la\lap+aY-lY\ 

-I- 2/^  [2  - O'] 

^ T^\\- P-l(x\laP-\- aY  ~'1y\ 


(15.36) 


(15.37) 


Figure  15-7.  Hangoff  errors  and  reaction  time  limitations  associated 
with  fixed-coefficient  tracking  filters. 
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and 


(15.38) 


^ ^ MV 

^ r'^[4-y^-2ci'][2<2y^+ <27 -2/] 

These  efficiencies  are  plotted  in  Figure  15-8. 

As  with  the  a-f5  tracker  the  a-/?-/ tracker  also  suffers  from  response  time  limitations  and 
hangoff  errors.  The  response  time  for  the  two  trackers  is  identical  and  given  by  Eq.  (15.3 1).  The 

hangoff  error  for  an  tracker  following  a target  with  a constant  jerk,  X , is  given  by 


As  before,  a must  be  determined  through  a trade  between  smoothing  efficiency,  hangoff,  and 
response  time. 


Figure  15-8.  Degree  of  smoothing  (variance  reduction)  achievable  in  an  a-y5-ytracking  filter. 
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Kalman  Filters 


A Kalman  filter  is  a tracking  filter  whose  coefficients  are  allowed  to  vary  based  on  the 
measured  residuals.  When  residuals  are  low,  the  “gains”  are  increased  to  provide  improved 
smoothing.  When  residuals  become  large,  the  gains  are  reduced  significantly  to  improve  the  filter’s 
ability  to  respond  rapidly  to  the  changes.  If  properly  implemented,  Kalman  filters  will  provide 
optimum  smoothing  for  a given  level  of  sensor  noise  and  disturbance  inputs. 

In  a Kalman  filter,  the  target  motion  is  assumed  to  be  a discrete  Markov  process.  By  this  it 
is  assumed  that  knowledge  of  the  state  vector  of  a system  at  an  instant  of  time  is  adequate  to  describe 
the  future  behavior  of  that  system.  Specifically,  the  state  vector  evolves  according  the  equation  [6] 

X(k  + 1)  = GX{k)  + Q(k)  + F{k  + 1|  k)  (15.40) 

where 

~X(k) 

X(k)=  V(k)  (15.41) 

_A(k)_ 

is  the  state  vector  at  measurement  point  k.  Note  we  have  assumed  a specific  form  for  the  state 
vector.  The  form  of  the  state  vector  for  a real  problem  will  be  dictated  by  the  nature  of  that  problem. 
G is  a presumed  known  transition  matrix  describing  the  gross  dynamics  of  the  target.  For  example, 
if  we  assume  that  the  dynamics  are  uniformly  accelerated  linear  motion,  then 

"l  T T^!2 

G=  0 1 T (15.42) 

0 0 1 

is  a vector  (with  covariance  Q - see  below  for  a definition  and  Appendix  C for  a more  detailed 

discussion  of  the  covariance  of  a random  vector)  describing  noise  in  the  target  motion.  Often  it  is 
assumed  to  be  zero-mean,  white,  Gaussian  noise.  A common  source  of  this  noise  is  turbulence 

buffeting.  The  vector  function  F is  a deterministic  input  accounting  for  motion  of  the  tracking 
platform. 

The  variance  of  a function  is 

Var(X)  = e[[X  - E(X))[X  - ^(X))} . (15.43) 

By  analogy  the  covariance  of  two  variables  X and  Y is 
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Cov(X,  ¥)  = e{{X-  E(X)){¥- £■(!'))} . 


(15.44) 


The  covariance  of  a vector 


is  defined  by 


x = {x„x„-,x,} 

¥„=Coy[x„X,) 


(15.45) 

(15.46) 


Var(A',)  Cov(V,,X,) 
Cov(V,,V,)  Var(V,) 

Cov(V„V„)  Cov(A'„V„) 


Cov(V„V„)' 

Cov(V„V„) 

Var(Xj 


(15.47) 


Sensors  measure  components  of  the  state  vector.  They  may  or  may  not  measure  the 
components  directly.  The  measurement  process  is  modeled  by  the  equation 


Y(k)  = HX(k)+N(k). 


(15.48) 


His  a transfer  function  describing  the  measurement  process.  A^(  A:)  is  a zero-mean,  white,  Gaussian 
measurement  noise  vector  with  covariance  R^. 

The  Kalman  filter  algorithm  has  two  parts:  a set  of  initial  values  and  a sequence  of  recursion 
relations. [6]  The  initial  values  are: 

observation  number: 


k = 0,  (15.49) 

estimated  error  (zero-mean  Gaussian): 

P(0 1 0)  = £{[V(0)  - £{X(0)}][V(0)  - £{X(0)}]"} 

= var{x(0)} 


where  the  superscript  T denotes  the  transpose  of  a matrix,  and  estimated  state  vector  at  observation 
0 given  no  measurements: 
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x{0 1 0)  = e{1(0)}  = {E(;i',(o)),  E(;i',(o)),  e(^„(o))}  . (is.si) 


The  covariances  Q and  are  constants. 

The  recursion  relations  in  order  of  execution  are: 
estimated  error  given  k observations: 

P{k  + \\k)^GP{k\k)G^  + Q 

Kalman  filter  gain: 

Kik^G  P{k  + ^\k)H^ 

^ ’ HP{k  + \\k)H^ + Rc 


(15.52) 


(15.53) 


estimated  error  given  k+1  observations: 

P(k  + \\k  + l)^[l-K(k  + \)H]P(k  + \\k)  (15.54) 

estimated  state  vector  given  k observations: 

X(k  + \\k)^  GX(k\k)  + F(k  + \\k)  (15.55) 

estimated  state  vector  given  k+1  observations: 

X(k  + 1| A:  + 1)  = X(k  + l\k)  + K(k  + l)[f (A:  + 1) - HX(k  + 1| A:)]  (15.56) 
incremented  observation  number: 


A:  ^ A:  + 1 


(15.57) 


At  the  conclusion  of  Eq.  (15.57),  the  recursion  loops  back  to  Eq.  (15.52)  for  the  next  measurement. 
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Comparison  of  Fixed-Coefficient  and  Kalman  Filters 

It  is  instructive  to  compare  and  contrast  fixed-coefficient  and  Kalman  filters  for  tracking 
applications.  Both  types  of  filters  are  fading-memory,  predictor-corrector,  recursive  filters.  That 
is,  the  filter  uses  previous  measurements  but  after  a period  of  elapsed  time,  the  influence  of  any 
individual  measurement  becomes  less  and  less  significant.  The  measurement  fades  from  the  filter 
memory.  The  filter  predicts  a position  and  corrects  its  prediction  process  based  on  observation.  The 
algorithm  is  recursive  in  that  it  involves  a fixed  set  of  rules  that  are  applied  repeatedly  once  each 
measurement  cycle. 

An  obvious  difference  between  the  two  is  that  fixed-coefficient  filters  employ  fixed  gains 
while  Kalman  filters  use  time-variable  gains  that  are  adaptively  varied  based  on  past  performance 
to  maximize  future  tracking  performance.  Both  techniques  provide  significant  smoothing,  but  the 
smoothing  from  the  Kalman  filter  should  usually  exceed  that  of  the  fixed-coefficient  filters. 

Fixed-coefficient  filters  have  simple  solutions  for  predicted  performance  that  can  assist 
design  analyses.  They  are  computationally  simple  and  efficient.  Thus  they  are  ideal  for  use  in  dense 
multiple-target,  high  data  rate  applications.  The  performance  of  a Kalman  filter  cannot  really  be 
predicted,  although  it  can  be  obtained  through  extensive  Monte  Carlo  simulations.  It  is  not 
uncommon  for  early  performance  estimates  to  be  performed  assuming  a fixed-coefficient  filter,  with 
full  knowledge  that  the  final  solution  will  be  a Kalman  filter  with  somewhat  better  performance. 
Kalman  filters  are  considerable  more  computationally  complex  than  fixed-coefficient  filters. 
Significant  processing  capabilities  may  be  required  for  multiple-target,  high  data  rate  applications. 

The  track  accuracy  of  fixed-coefficient  filters  degrades  rapidly  as  “missed  detections” 
become  frequent.  Kalman  filters  adjust  their  gains  to  accommodate  missed  detections  with  the  least 
possible  degradation  in  performance.  Track  accuracy  of  fixed-coefficient  filters  degrades  if 
miscorrelation  (frame-to-frame  association  of  target  detections  to  specific  tracks)  occurs.  Frequent 
miscorrelation  can  cause  track  loss.  The  Kalman  filter  model  does  not  include  miscorrelation  noise. 
Miscorrelation  can  lead  to  rapid  divergence  of  track  accuracy,  further  increased  miscorrelation,  and 
ultimate  track  loss.  High  missed  detections  favors  Kalman  filters.  High  miscorrelation  favors  fixed- 
coefficient  filters. 

The  response  time  to  adjust  to  target  maneuvers  is  coefficient  dependent,  but  predictable,  in 
fixed-coefficient  filters.  Maneuvers  will  degrade  track  accuracy.  However,  if  the  maneuver  does 
not  carry  the  target  out  of  the  tracking  window  before  the  filter  can  adjust  and  if  the  target  returns 
to  the  assumed  class  of  dynamics  after  completion  of  the  maneuver,  then  fixed-coefficient  filters  can 
follow  the  target  through  the  maneuver.  Target  maneuvers  are  not  included  in  the  Kalman  filter 
model.  Violent  maneuvers  may  produce  miscorrelation  and  ultimate  loss  of  track.  If  this  does  not 
occur,  the  Kalman  filter  will  also  follow  the  target  through  the  maneuver.  With  either  filter  type  it 
is  good  practice  to  run  multiple  filters  with  different  dynamics  models  that  can  capture  likely 
maneuvers.  This  is  less  of  a penalty  in  fixed-coefficient  filters  than  in  Kalman  filters. 
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Multi-Sensor,  Multi-Platform  Tracking 

Many  sensor  systems  are  capable  of  making  measurements  of  target  angular  position.  This 
is  useful  in  itself,  but  often  more  (the  target  range)  is  desired.  A commonplace  suggestion  is  to  use 
two  (or  more)  sensors  and  triangulate  to  find  the  target  range.  However,  this  is  easier  said  than 
done.  Consider  Figure  15-9.  In  the  first  instance  there  are  two  sensors  and  three  targets.  Since  only 
angle  measurements  are  available,  the  three  lines  of  sight  from  each  of  the  two  sensors  give  nine 
possible  target  positions  (positions  where  two  lines  of  sight  cross)  of  which  only  three  are  real.  The 
other  positions  are  “ghost”  targets.  Not  all  combinations  of  positions  are  possible,  but  every  ghost 
position  is  viable  in  one  potential  target  set  or  another.  The  most  common  suggestion  for 
“deghosting”  is  to  add  a third  sensor  (second  instance  in  Figure  15-9).  Now  valid  target  positions 
require  all  three  lines  of  site  to  occur  at  a point  (so-called  “triple  crossings”).  This  would  seem  to 
be  an  acceptable  technique,  however,  we  have  neglected  the  potential  for  sensor  errors.  The  third 
instance  in  Figure  1 5-9  shows  the  effects  of  random  errors  in  the  measured  target  angles.  The  triple 
crossings  disappear.  One  is  tempted  to  assume  that  if  three  lines  intersect  forming  a triangular 
region  that  is  small  enough,  then  this  is  the  equivalent  of  a triple  crossing  point.  However,  some 
uncertainty  would  remain.  In  the  figure  it  appears  that  triangular  regions  near  the  true  target  points 
exist,  but  there  are  several  others  just  as  small  that  would  give  gross  errors.  In  the  fourth  instance, 


Figure  15-9.  Problems  with  using  triangulation  of  angle-only  measurements  to  estimate  range. 
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the  sensors  are  assumed  to  have  position  measurement  errors.  When  triangulation  is  attempted,  the 
wrong  baselines  will  be  assumed.  This  also  breaks  the  triple  crossing  degeneracies.  If  small 
triangular  regions  are  assumed  to  indicate  targets,  then  many  ghosts  appear.  The  combination  of 
angular  errors  and  position  errors  makes  the  problem  even  harder.  Addition  of  still  more  sensors 
will  ultimately  solve  the  problem,  but  use  of  only  two  or  three  may  not. 

The  magnitude  of  the  effects  of  measurement  errors  can  be  determined  from  simple 
geometric  analysis.  Consider  the  first  instance  in  Figure  15-10.  This  is  a system  in  which  both 
sensors  make  perfect  angular  measurements  but  the  sensors  make  a small  error  5L  in  measuring  their 
separation  Lg.  If  Rg  is  the  true  range  to  the  target,  the  error  in  estimating  range  is  easily  determined 
to  be 

SR  - SL[Rq  / Lq)  - SL coi0Q  (15.58) 

where  6g  is  the  angular  difference  between  the  sensor  angle  measurements.  If  we  pick  reasonable 


Figure  15-10.  Geometry  for  evaluation  of  target  position  errors  given  errors 
in  sensor  position  measurements  and  in  sensor  angle  measurements. 
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values  for  the  parameters,  such  as,  Lq=\  km,  Rq=  100  km,  and  <SL  = 10  m,  then  we  find  ^ = 1 km. 
This  is  not  too  bad  for  many  applications  (target  localization,  targeting  for  carpet  bombing,  or  cuing 
a weapon  with  a terminal  seeker),  but  inadequate  for  others  (indirect  fire  artillery). 

Now  consider  the  second  instance  in  the  figure.  In  this  case,  the  baseline  is  assumed  to  be 
accurately  known  but  the  sensor  angles  are  assumed  to  have  a small  relative  angular  error  <f>.  From 
the  given  geometry  it  is  easy  to  determine  that 


5R  - Lq  cot(^Q  -(f)-  Rq 


1 + tan  6q  tan  (j) 
tan  6q  - tan  (j) 


(15.59) 


^ i?o  + Zotan^  ^ 
^ Lq-  Rq  tan  (j)  * 


Now  assuming  the  same  values  as  before,  i.e.,  Lg=\  km  and  = 100  km,  and  a modest  value  of  the 
angular  error,  ^=\  mrad,  we  calculate  that  = 1 1 . 1 km.  This  is  too  large  for  almost  any  purpose. 

Some  angle  sensors  (such  as  an  infrared  search  and  track  system)  can  make  angle  measure- 
ments better  than  1 mrad,  although  many  others  (radio  frequency  interferometers,  radars  in  which 
the  range  function  is  being  jammed)  cannot.  Let  us  consider  a very  realistic  system.  The  Global 
Positioning  System  can  provide  position  accuracy  to  roughly  10  m for  military  systems.  Thus,  any 
two-platform  system  should  know  its  position  well  enough  to  get  range  to  1%  accuracy,  if  angular 
errors  were  not  present.  Let  us  do  an  error  budget  for  a system  attempting  to  obtain  1 mrad  overall 
angular  accuracy  (giving  approximately  1%  range  error).  First,  each  platform  must  determine  the 
direction  of  true  north  relative  to  the  platform’s  reference  system  (in  order  to  communicate  the 
angles  to  each  other,  a universal  coordinate  system  must  be  used).  There  will  be  some  error  in  this 
determination.  Next,  the  sensor  making  the  target  angle  measurements  must  be  aligned  with  to 
respect  to  the  platform  reference  system.  There  will  be  a significant  error  in  this  (most  combat 
platforms  are  flexible  and  not  perfectly  elastic  - they  can  warp  significant  amounts  in  the  course  of 
combat  operations).  Lastly,  the  sensor  must  estimate  the  position  of  the  target  with  respect  to  its 
internal  references.  All  together  we  have  at  least  6 angular  error  sources  (3  from  each  platform). 
Assuming  all  are  independent  and  that  all  are  roughly  equal  (the  optimistic  assumption),  then  the 
allowable  error  per  source  is  l/Vh  = 0.41  of  the  total  allowable  error.  An  0.41  mrad  error  is  difficult 
to  achieve  in  any  of  the  error  sources  let  alone  all  of  them.  It  must  also  be  considered  that  even  1% 
range  error  is  not  adequate  for  missions  such  as  artillery  support.  Targeting  of  sites  to  GPS 
accuracies  to  permit  GPS  guided  weapons  to  be  employed  will  require  0.1%  errors  at  10  km  range 
decreasing  to  0.001%  at  1000  km  range. 
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Angle-only  tracking  performance  can  be  improved  in  several  ways.  The  first  is  to  use  a 
larger  baseline.  As  Lg  increases,  the  rms  error  SR  decreases.  The  ideal  geometry  for  triangulation 
is  to  have  the  baseline  comparable  to  the  range.  In  this  ideal  geometry  the  rms  error  is  approxi- 
mately the  rms  angular  error  times  the  range.  If  we  use  a 1 00  km  baseline  in  our  example,  we  might 
achieve  something  of  the  order  of  5R/R  = 0.001  rather  than  the  SR/R  = 0.11  that  was  achieved. 
Unfortunately,  there  are  many  situations  where  the  ideal  geometry  cannot  be  achieved.  The  situation 
can  also  be  ameliorated  by  introducing  multiple  sensors  and  averaging  over  many  measurements  to 
reduce  the  contributions  of  errors.  The  rms  range  error  will  decrease  as  the  square  root  of  the 
number  of  independent  measurements  made  (whether  made  from  different  locations  by  the  same 
sensor  or  by  other  sensors).  However,  if  sensor  bias  error  is  the  dominant  error  source,  then  multiple 
measurements  will  not  improve  the  accuracy,  and  multiple  sensors  will  only  improve  the  error  by 
the  square  root  of  the  number  of  sensors  employed.  As  a consequence,  it  still  often  remains  much 
easier  to  talk  about  using  triangulation  between  multiple  platforms  than  it  is  to  actually  implement 
it  in  a real  system. 
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Target  Motion  Analysis 

The  preceding  discussion  of  tracking  and  range  estimation  of  targets  from  multiple  platforms 
often  elicits  the  question  as  to  whether  a target  can  be  tracked  using  measurements  made  at  different 
positions  (and  obviously  different  times).  The  answer  to  this  question  is  a qualified  yes  and  requires 
a discussion  of  the  topic  of  target  motion  analysis  (TMA).[7],  [8]  TMA  involves  the  study  of 
angular  bearings  to  a target  over  time  taking  account  of  the  motion  of  the  observing  platform. 

Consider  an  observing  platform  (denoted  by  O’s)  moving  in  a straight  line  at  a constant 
velocity  and  a target  (denoted  by  X’s)  also  moving  in  a straight  line  at  a constant  velocity.  Five 
bearing  observations  made  at  equal  time  intervals  are  shown  in  Figure  15-11.  For  the  constant 
velocity  target  and  platform,  there  is  a smooth  change  in  relative  bearing  over  time.  However,  the 
angle  only  information  is  insufficient  to  determine  the  target  range.  A second  target  at  a different 
range  moving  at  a different  constant  velocity  is  shown  that  produces  the  exact  same  bearing 
measurements  as  the  first  target.  In  fact,  there  are  an  infinite  number  of  possible  constant- velocity 
paths  that  produce  the  same  bearing  measurements.  There  is  one  possible  solution  for  each  possible 
value  of  the  initial  target  range.  Distant  targets  must  move  faster  and  have  a more  acute  closure 
angle  than  nearby  targets,  but  they  are  all  mathematically  acceptable.  If  there  is  no  other  informa- 
tion available,  such  as  an  initial  range,  or  a known  velocity,  there  is  no  way  to  discriminate  between 
solutions  in  the  scenario  presented  in  Figure  15-11. 


Figure  15-11.  Many  possible  target  positions  and  velocities  satisfy  any  set  of  bearings 
measured  from  a platform  in  straight  line  motion. 
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A solution  to  the  problem  can  be  obtained  if  one  changes  the  scenario.  In  Figure  15-12,  the 
target  still  continues  to  have  uniform  straight  line  motion,  but  the  observing  platform  makes  a 
maneuver  midway  through  the  observations.  Before  the  maneuver,  the  bearing  rates  are  consistent 
with  many  different  target  motions.  After  the  maneuver,  the  new  bearing  rates  are  also  consistent 
with  many  different  target  motions.  However,  the  consistent  solutions  after  the  maneuver  are  not 
the  same  consistent  solutions  as  before  the  maneuver.  In  fact,  there  is  only  one  range  and  velocity 
that  gives  consistent  bearing  observations  both  before  and  after  the  platform  maneuver.  In  Figure 
15-12,  postulated  target  motion  1 is  consistent  both  before  and  after  (target  1 is  the  actually  target 
track).  Postulated  target  motion  2 is  consistent  only  before  the  maneuver.  After  the  maneuver,  that 
motion  requires  bearing  observations  that  no  longer  agree  with  the  true  observations.  Target  track 
2 can  be  eliminated. 

A geometry  useful  in  analyzing  TMA  is  shown  in  Figure  15-13.  The  observation  platform 
makes  two  bearing  measurements  a,  and  a2  separated  in  distance  by  VqT  where  Vq  is  the  known 
velocity  of  the  platform  and  T is  the  known  time  between  measurements.  The  target  is  initially 
located  at  an  unknown  range  R,  traveling  at  unknown  speed  Vj  at  an  unknown  angle  /relative  to  the 
bearing  direction.  Assume  that  the  target  range  and  speed  are  known.  These  are  probably  nothing 
but  a guess,  but  we  can  assign  values  to  the  parameters,  whether  or  not  the  parameters  are  correct. 
At  speed  Vj  the  target  can  move  a distance  VjT  in  the  time  between  measurements. 


Figure  15-12.  Bearings  measurements  made  along  two  linear  segments  resolve  ambiguities. 
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Figure  15-13.  Geometry  used  in  target  motion  analysis. 


One  side  of  the  quadrilateral  defined  by  the  target  and  platform  positions  at  the  two 
measurement  times  is  known  precisely  as  are  two  of  the  four  angles.  Two  more  sides  are  assumed 
to  be  known.  This  information  is  sufficient  to  determine  all  of  the  other  quadrilateral  characteristics. 
Specifically,  we  are  interest  in  determining  what  angles  /satisfy  the  observations. 

It  is  convenient  to  determine  the  length  I of  the  diagonal  connecting  the  initial  target  position 
to  the  final  platform  position.  Using  the  law  of  cosines  this  is  given  by 

- RI  + ~ cos«j  (15.60) 


Similarly,  the  angle  fig  made  between  this  diagonal  and  the  platform  motion  side  of  the  quadrilateral 
can  also  be  found  using  the  law  of  cosines 

=l\+{VoT)^  -21, VoT cos (15.61) 
Solving  for  the  angle  yields 


Pq  = COS 


-1 


2i,VJ 


J 


(15.62) 


or 
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y^O  = COS  ' 


VqT-  coscifj 


VL 


Rl  + {Vjf  -IR.VoTcosa, 


nl/2 


(15.63) 


The  third  angle  of  this  triangle  yg  could  be  found  in  the  same  way,  i.e., 


Xo  = cos 


-1 


^ Rl  + e]-{v„T)"'^ 


and 


V 

f 


2i,R, 


ro  = cos 


-1 


-V(jT cosctj 


Rf  +{VoTf  -2R,VoTcosa, 


1/2 


(15.64) 


(15.65) 


or  it  could  be  determined  from  the  fact  that  triangles  have  angles  that  must  sum  to  n, 

7i-ay-  (15.66) 

Given  a solution  for  Pg,  the  other  triangle  defined  by  the  diagonal  can  be  addressed.  From 
the  law  of  sines  we  have 


V^T 


A 


sin(A-y^o)  sin^i  ’ 


(15.67) 


which  can  be  rearranged  to  yield 


sin^j 


fiSin(;^i-;^o) 


V^T 


(15.68) 


and  then  solved  to  yield  the  third  angle  Sj  of  the  quadrilateral 


5^  = sin 


-1 


A sin(A->^o) 


V 


VjT 


(15.69) 


The  desired  angle  y,  is  found  from  the  requirement  that  the  four  angles  of  a quadrilateral 
must  sum  to  2ti.  Thus, 
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;Ki  = ln  - a^-  f5^-  5^  = n -v  - a^  - 


(15.70) 


Substituting  in  the  earlier  expressions  for  and  ^ yields 


Y,=lK-a,-p, 

9 / \2 

Rl+(VJ)  -IR.VJco^a, 


-1 


sin 


ni/2 


f 

sin 

VV  V 


- COS  ' 


VqT-R^  cos<2j 

Rl  +(Vjf  -IRyjcosa, 


1/2 


Eq.  (15.71)  Contains  only  measured  or  assumed  quantities.  For  an  assumed  range  and  speed,  there 
are  at  most  two  mathematical  solutions  for  y,.  If  we  make  a third  bearing  measurement,  and  repeat 
this  analysis,  one  of  the  original  two  solutions  will  be  found  to  inconsistent  with  the  angles  and 
ranges  associated  with  the  second  computation. 

With  estimates  for  both  y,  and  Yq  in  hand,  it  is  possible  to  predict  the  value  of  target  range 
R2  at  the  time  of  the  second  bearing  measurement.  Using  the  law  of  cosines  again  we  have 

RI  = +{VjT)^  cos(/i  -Yo)-  (15.72) 

Substitution  of  the  expressions  for  y,  y^,  and  into  this  expression  (to  produce  an  expression 
dependent  only  on  measured  or  assumed  values)  is  left  as  an  exercise.  The  predicted  value  of  7?^  can 
now  be  used  as  the  starting  point  for  analyzing  the  next  bearing  measurement.  It  should  also  be 
noted  that  (the  value  of  y,  that  should  result  from  analyzing  the  next  pair  of  bearing  measurements 
can  also  be  estimated  from  the  value  of  Sj,  i.e., 

y 2=  71-5^.  (15.73) 

The  difference  between  the  value  of  predicted  from  the  previous  measurement  analysis  and  the 
value  calculated  from  the  succeeding  measurement  analysis  is  a residual  that  can  be  used  to 
determine  how  significantly  measurement  errors  are  affecting  the  “track”. 

Three  bearing  measurements  can  be  used  to  uniquely  identify  the  target  trajectory  IF  the 
range  and  speed  are  assumed  to  be  known  (and  if  the  measurements  are  sufficiently  accurate).  If 
the  bearing  measurements  have  errors,  then  for  each  pair  of  measurements  (and  for  each  assumed 
set  of  range  and  speed  values)  the  predicted  motion  angles  (y,)  will  be  different.  Only  by  making 
a large  number  of  measurements  and  smoothing  with  a filter  will  a “precise”  angle  be  determined. 
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In  practice,  we  do  not  know  either  the  range  or  the  speed.  However,  the  speed  is  almost 
always  limited  to  a moderately  narrow  range  of  values.  Under  normal  conditions  both  ships  and 
aircraft  tend  to  move  with  speeds  between  the  speed  that  gives  them  optimum  fuel  consumption  (for 
maximum  range)  and  their  maximum  sustained  speed.  Submarines  hunting  for  targets  tend  to  move 
at  speed  slightly  below  the  maximum  speed  that  gives  them  reasonably  quiet  signatures.  These 
speed  ranges  are  usually  approximately  known  in  advance.  If  the  target  type  can  be  estimated,  then 
the  speed  can  be  estimated  (to  perhaps  a factor  of  two  accuracy  or  possibly  even  better).  Range 
might  also  be  estimated.  At  initial  detection  of  a target,  it  is  reasonable  to  assume  that  the  target  is 
near  the  maximum  detection  range  of  the  sensor.  This  range  clearly  depends  on  the  target  signal 
strength  (“small”  targets  cannot  be  detected  as  far  away  as  “large”  targets).  However,  if  the  target 
type  can  be  guessed,  then  the  detection  range  can  be  estimated  (often  to  better  than  factor  of  two 
accuracy).  Other  information  may  be  available  that  can  aid  in  refining  these  estimates.  Sonar  or 
passive  rf  signatures  may  identify  the  target  type.  They  may  even  provide  information  on  possible 
target  motion.  For,  example,  Doppler  shift  in  an  IFF  (identification  friend  or  foe)  transponder  return 
(some  classes  of  IFF  emitters  tend  to  have  relatively  precise  emission  frequencies)  or  blade  rate  data 
from  a sonar  might  give  an  improved  estimate  of  velocity. 

There  is  no  single  well-defined  algorithm  for  performing  a target  motion  analysis.  This  is 
an  area  of  active  research  interest.  We  will  assume  that  the  bearings  to  the  target  have  been 
measured  over  two  extended  straight  line  segments  that  are  not  collinear.  One  algorithmic  approach 
(not  necessarily  the  best  nor  the  one  used  in  most  systems)  to  determining  the  real  target  position 
is  to  assume  a target  speed,  estimate  the  motion  angles  (using  the  first  set  of  straight  line  measure- 
ments) for  several  possible  initial  ranges.  The  process  is  repeated  for  the  second  set  of  straight  line 
measurements.  One  range  will  give  less  angular  error  between  the  motion  in  the  first  set  and  the 
motion  in  the  second  set  than  will  the  other  ranges.  If  these  errors  are  monotonically  decreasing 
as  one  end  of  the  spread  of  values  is  approached,  then  additional  values  should  be  included  until  a 
minimum  is  obtained.  Next  this  minimum-angular-error  range  is  assumed  and  the  motions  for  each 
segment  are  estimated  for  several  velocities  around  the  velocity  originally  selected.  One  velocity 
will  yield  less  angular  error  than  the  others.  Using  this  new  velocity,  the  process  is  repeated 
assuming  several  range  values  (usually  closer  together  than  the  original  set  of  range  values). 
Iterating  back  and  forth  between  fixing  the  range  and  velocity  should  ultimately  result  in  conver- 
gence to  a single  range  and  velocity  that  yields  minimum  error  in  the  motion  angles  before  and  after 
the  break  in  straight  line  measurements.  NOTE:  in  the  presence  of  unavoidable  measurement  errors, 
convergence  means  that  the  error  will  reduce  to  a small  value  after  which  it  will  not  continue  to  get 
smaller.  The  iterations  should  be  halted  as  soon  as  the  error  stops  shrinking.  It  is  not  possible  to 
obtain  a zero-error  solution  when  the  initial  measurements  contain  errors. 

There  are  several  techniques  for  estimating  the  target  range  from  a very  limited  number  of 
measurements.  The  estimates  are  not  as  accurate  as  the  lengthy  recursive  analysis  of  an  extended 
series  of  measurements,  but  they  can  be  performed  quickly  and  simply.  One  of  these  techniques  is 
called  Ekelund  ranging.  Consider  the  geometry  shown  in  Figure  15-14.  If  we  define  the  quantities 
Uq  and  Uj  as  the  component  of  velocity  (of  the  platform  and  target,  respectively)  perpendicular  to 
the  measured  bearing,  then  from  two  bearing  measurements  made  during  straight  line  motion  we 
find 
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Figure  15-14.  The  basics  of  geometry  for  analyzing  Ekelund  ranging. 


sin(«2  ~^i)  - (15.74) 

where  T is  the  time  between  the  two  bearing  measurements  and  the  transverse  velocity 

components  are  related  to  the  total  velocity  components  by 

t/o=FoSincti  (15.75) 

and 


Uj  = Vj.sin/^. 


Dividing  both  sides  of  Eq.  (15.74)  by  the  time  difference  T yields 

T?2  sin(ct2  ~^i) 


= Uo-U,. 


(15.76) 
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Now  let  the  time  difference  become  small,  i.e.,  T— >0.  The  difference  between  bearings  will  become 
small,  so  that  sin  x~x.  The  small  bearing  difference  divided  by  a small  time  difference  approxi- 
mates the  derivative  of  the  bearing  with  respect  to  time.  The  range  will  not  change  appreciably  over 
the  measurement  interval  so  we  have  ~ ~ R-  Thus,  Eq.  (15.76)  can  be  rewritten  as 


lim' 

r-^-o 


sin(ct2  - i?(«2-«i)  7?Aa 


R 


da 

dt 


^Uo-Ur  (15.77) 


The  range  times  the  bearing  rate  is  equal  to  the  difference  in  transverse  velocity  components. 

Now  consider  bearing  rate  measurements  immediately  before  and  after  a change  in  platform 
direction.  Immediately  before  the  direction  change  we  have 

da 

R ^Uo-U^.  (15.78) 

dt 


Immediately  after  the  direction  change,  the  range  remains  the  same,  the  target  transverse  velocity 
remains  the  same,  but  the  bearing  rate  and  platform  transverse  velocity  will  be  different.  We  have 

R^  = U'o-Uj.  (15.79) 

dt 


Eqs.  (15.78)  and  (15.79)  can  be  solved  to  yield  the  range  R 


or 


and  finally 


R^-R^  = {Uo-U,)-(U'o-U,) 


R 


da  da 


'\ 


dt  dt 


-Uo-U'o 


R=  ^o-U'o 


da  da' 


dt  dt 


(15.80) 

(15.81) 


(15.82) 


Equation  (15.82)  is  the  Ekelund  estimate  of  target  range.  It  should  be  noted  that  this  is  estimate  can 
have  substantial  errors  because  it  has  the  difference  of  two  small  inaccurate  quantities  in  the 
denominator.  A bearing  can  be  estimated  with  a certain  limited  precision.  A bearing  rate  adds  the 
uncertainties  in  the  bearing  measurements.  A difference  in  bearing  rate  adds  the  uncertainties  in  two 
bearing  rate  measurements.  This  cascading  of  uncertainties  leads  to  substantial  error  potential  in 
the  estimate  of  range.  If  the  time  is  allowed  to  grow  large  enough  to  reduce  errors  in  bearing  rate, 


486 


then  errors  associated  with  range  changes  and  the  approximation  of  the  sine  function  will  become 
significant.  Range  errors  in  excess  of  10%  are  commonplace  and  may  be  much  larger.  Neverthe- 
less, the  Ekelund  range  is  better  than  no  range  estimate  at  all  and  may  often  be  sufficient. 

Even  without  being  able  to  perform  the  maneuver  (or  before  initiating  that  maneuver)  it  is 
possible  to  obtain  a limited  amount  of  information  from  TMA.  Figure  15-15  shows  the  true  motion 
and  the  observed  (relative)  motion  for  four  possible  situations.  In  Figure  15- 15a,  the  target  is 
overtaking  the  platform.  The  bearing  rate  is  positive  and  monotonically  increasing  as  long  as  the 
range  is  decreasing.  Once  minimum  range  has  been  passed,  the  bearing  rate  will  continue  to  be 
positive  but  it  will  begin  to  monotonically  decrease.  In  Figure  1 5- 1 5b,  the  target  is  being  overtaken 
by  the  platform.  The  bearing  rate  is  negative  and  monotonically  increases  until  minimum  separation 
occurs,  after  which  it  remains  negative  but  monotonically  decreases.  Figure  15- 15c  show  a target 
that  is  passing  the  platform.  The  bearing  rate  is  negative  and  behaves  exactly  like  the  overtaken 
target  case.  However,  the  bearing  rates  tend  to  be  noticeably  higher  than  in  the  overtaken  target. 
Nevertheless,  it  may  be  possible  to  confuse  the  two  situations. 

One  case,  however,  is  unequivocal.  Figure  1 5- 1 5d  shows  a target  on  a collision  course  with 
the  platform.  The  bearing  rate  is  zero.  Zero  bearing  rate  is  a red  flag  for  any  tracking  situation.  The 
only  straight-line,  constant- velocity  motions  that  produce  zero  bearing  rate  are  collision  courses  or 
courses  that  were  collision  courses  in  the  past.  The  latter  results  if  the  target  originated  from  the 
platform  or  had  a near  miss  with  the  platform.  For  example,  a missile  launched  from  the  platform 
or  a target  that  has  overflown  the  platform  will  be  outbound  and  exhibit  zero  bearing  rate.  It  is  rare 
that  information  allowing  discrimination  of  inbound  collision  courses  from  outbound  near-misses 
is  completely  absent. 

A skilled  observer  can  use  the  bearing  rate  data  to  estimate  the  target  motion.  However,  it 
is  possible  for  even  an  unskilled  observer  to  calculate  the  direction  of  relative  motion.  This  quantity 
^is  the  target  motion  direction  relative  to  the  bearing  of  the  platform  motion.  The  geometry  used 
to  analyze  this  problem  is  shown  in  Figure  15-16.  Applying  the  law  of  sines  to  the  triangle  forme 
from  the  first  and  second  bearings  yields 


sin(;r  - ct j - ^)  sin(ctj  -I-  sin(ctj  - <^2 ) 

h h 


sin(ctj  -«2) 
^(^2-0 


(15.83) 


Applying  the  law  of  sines  to  the  triangle  formed  from  the  second  and  third  bearings  yields 

sin(:^-a^  sin(ci!'3 -I- sin(ci!'2  ~ ^3) 


h 


h 


F(?3  ^2) 


(15.84) 
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Figure  15-15.  Relative  motions  observed  in  different  target  motion  situations. 


a)  overtaking  target 


b)  overtaken  target 


c)  passing  target 


TRUE  RELATIVE 


TRUE 


RELATIVE 


TRUE  RELATIVE 
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Figure  15-15  (continued).  Relative  motions  observed  in  different  target  motion  situations. 


Figure  15-16.  Geometry  used  to  determine  the  direction  of  relative  motion. 


TRUE  MOTION  RELATIVE  MOTION 
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Solving  both  Equations  (15.83)  and  (15.84)  for  h/V and  equating  the  results  yields 


h _(?2-^i)sin(ai+^)_(?3-?2)sin(a3+^) 

TV  ' ( \ ' ( \ (15.85) 

V sin(<2j-<22j  sin(ci'2 -<23) 

A little  mathematical  legerdemain  allows  us  to  obtain  a more  useful  expression.  That  is,  adding  and 
subtracting  a,  from  the  arguments  of  both  numerator  and  denominator  on  the  right-hand  side  and 
regrouping  the  terms  and  then  expanding  the  two  sin[ (a)-(b)]  terms  with  four  angles  in  them  yields 

(^2  - h ) + ^)  _ (h  - h ) sin((o^i  + ^)  ~ («i  ~ «3 )) 

sin(ci' j - a 2)  sin((ci' j - <23 ) - (ct j - ct2 )) 


(^3  “ ^2 )(sin(ct j + cos(ct j - a 2)-  cos(ct j -f  j - <23 )) 

sin(«j  -«3)cos(«i  -«2)“COs(«j  -«3)sin(ctj  - a 2) 


With  a little  more  algebra  we  obtain 


(15.86) 


1 


1 


(15.87) 


which  can  be  rewritten  in  terms  of  cotangents  as 

(?2-?i)(cot(«i-a2)) 

(^3  “ ^2 )(cot(<2 j - a 2)-  cot(ct j + ^)) 


Rearranging  terms 


(15.88) 
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{h  -^i)(cot(o!'j  -a2)-cot[a^  “<^3)) 

= (^3  -^2)(cot(ci'j  -<23)-cot(ci'j  +C)) 

and  simplifying  yields 

(?2  -?i)cot(a,  -0C2) 

= (?3  - ?j)cot(<2j  - ^^3)-  (^3  - ?2)cot(<2j  +4^) 
This  may  finally  be  solved  to  yield  the  direction  of  relative  motion. 


(15.89) 


(15.90) 


4'  = -«1  +cot  ^ 


(?3  -?j)c0t(<2j  -<T3)-(f2  -^i)c0t(<2j  -OC2) 

{h  ~h) 


(15.91) 


There  is  a final  special  case  of  TMA  that  can  be  useful.  If  the  target  is  so  distant  that  it 
cannot  move  a significant  fraction  of  the  range  between  the  target  and  the  observing  platform,  then 
the  target  may  be  assumed  to  be  stationary.  This  is  not  an  uncommon  situation.  For  example,  a jet 
aircraft  at  a range  of  1 00  nmi  can  only  move  5 nmi  in  any  direction  in  the  time  it  takes  the  observing 
platform  to  establish  a 5 nmi  triangulation  baseline.  A submarine  detected  in  a convergence  zone 
can  only  move  1 or  2 nmi  in  the  time  it  takes  for  a destroyer  or  a helicopter  to  establish  a 1-2  nmi 
baseline.  The  motion  will  result  in  errors  but  these  errors  are  typically  not  significantly  larger  than 
the  errors  produced  by  triangulation  using  measurements  with  typical  measurement  errors.  Figure 
15-17  shows  the  geometry  for  analyzing  passive  ranging  as  single-platform  triangulation  of  distant 
targets  is  called. 

Applying  the  law  of  sines  to  the  triangle  shown  in  Figure  15-17  yields 


R d d d 

sinctj  sin/  sm{7r-a^-  sin(aj-i-y^) 

Rearranging  terms  in  this  equation  yields  the  result 

^ dsina^  dsina^  VTsina^ 
sin(a:j sin(a2“t^i)  sin(a2“Ctj) 


(15.92) 


(15.93) 
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Figure  15-17.  Geometry  for  passive  ranging  of  distant  targets  using  triangulation. 


h — d — ^ 
= VT 


where  d is  the  baseline  distance,  V is  the  platform  velocity  (not  the  target  velocity),  T is  the  time 
used  to  synthesize  the  triangulation  baseline,  and  a,  and  a2  are  the  bearing  angles  relative  to  the 
platform  motion.  The  effects  of  measurement  errors  can  be  estimated  using  the  same  relations 
developed  for  multi-platform  tracking  (Eqs.  (15.58)  and  (15.59)).  In  the  earlier  section,  these  errors 
were  estimated  to  be  in  the  range  of  1-10%  of  the  target  range.  This  is  comparable  to  the  error  that 
would  result  if  the  target  could  move  1-10%  of  the  target  range. 
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Problems 


15-1.  Describe  how  monopulse  tracking  operates?  In  what  types  of  target  scenarios  is  monopulse 
tracking  going  to  work  best? 

15-2.  Why  are  conical  scan,  reticle,  sequential  lobing,  etc.  poor  extended  target  trackers? 

15-3.  Describe  the  basic  functioning  and  utility  of  a tracking  filter. 

15-4.  In  any  tracking  filters,  what  penalties  do  you  pay  for  increased  track  smoothing? 

15-5.  You  have  the  problem  of  tracking  flies  (one  at  a time  — a fly  is  nominally  1 cm  in  diameter 
and  moves  at  speeds  up  to  5 m/s)  at  ranges  up  to  100  m.  What  techniques  might  be 
employed  if 0.0000 1 rad  tracking  precision  is  required.  If  a fly  can  complete  a maneuver  in 
I msec  and  the  tracking  sensor  is  capable  of  only  0.0001  rad  measurement  precision,  what 
filter  gains  and  measurement  intervals  should  be  used  to  avoid  serious  track  degradation. 

15-6.  What  are  the  differences  between  the  two  types  of  fixed  coefficient  filters? 

15-7.  Compare  the  signals  that  a conical  scan  radar  and  a spin  scan  AM  reticle  seeker  would  see 
for  targets  at  the  center  of  the  scan  and  for  targets  near  the  edge  of  the  scan.  A tracking 
system  can  employ  either  a-P  or  a-P-y  filters.  For  a=0.2  determine  the  optimum  p and  y 
coefficients.  Determine  the  response  time  and  the  hangoff  errors  for  each.  What  condition 
would  cause  one  or  the  other  of  these  filters  to  be  much  more  desirable  than  the  other? 

15-8.  What  simple  output  signal  characteristic  is  analyzed  to  determine  the  tracking  error  signals 
in  all  of  the  following:  conical  scan,  sequential  lobing,  and  some  spin-scan  reticle  tracking 
approaches? 

15-9.  What  broad  class  of  single  target  trackers  works  well  against  extended  (resolved)  targets? 

15-10.  Describe  the  track-while-scan  tracking  process. 

15-11.  You  wish  to  track  a target  of  50  m length  that  has  its  predominant  signature  in  the  infrared. 
If  the  target  range  is  roughly  1 m,  what  tracking  options  would  you  consider?  If  the  target 
range  is  roughly  50  km,  what  tracking  options  would  you  consider?  What  tracking  options 
would  you  consider  in  each  case  if  several  targets  needed  to  be  tracked  simultaneously? 

15-12.  You  wish  to  track  by  using  triangulation.  Your  platforms  have  differential  GPS,  so  the 
baseline  accuracy  can  be  as  small  as  0.1  m.  If  the  baseline  is  I km  and  the  target  range  is 
roughly  100  km,  what  important  error  sources  remain?  Estimate  the  magnitude  of  those 
errors  and  the  magnitude  of  the  resulting  range  estimation  error. 
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15-13.  Derive  an  expression  for  in  F igure  1 5- 1 3 in  terms  of  only  the  assumed  Rj  and  V^T  and  the 
measured  a,,  /3„  and  VqT. 

15-14.  A ship  is  making  10  knots.  It  begins  a set  of  bearing  measurements  on  an  unknown  target 
at  time  0.  After  5 minutes  it  makes  a 90  degree  turn  to  starboard  and  continues  the 
measurements.  The  relative  bearing  measurements  (direction  of  bow  equals  0 degrees)  are 
TIME  BEARING 

(minutes)  (degrees) 

0 60.0 

4 58.9 

6 355.3 

10  340.3 

Estimate  the  range  to  the  target  at  the  turn. 

15-15.  A target  is  assumed  to  be  distant  relative  to  a platform  moving  with  a velocity  of  300  m/s. 
The  platform  observes  the  target  at  a relative  bearing  of  30°  and  then  observes  the  target  10 
seconds  later  at  a bearing  of  31°.  Roughly  how  far  away  is  the  target? 
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APPENDIX  A 


UNITS,  PHYSICAL  CONSTANTS,  AND  USEFUL  CONVERSION 
FACTORS 


Units 


This  book  conforms  to  the  use  of  the  International  System  of  Units  (SI)  wherever  praetieal 
[1],  This  metric  system  is  based  on  seven  base  units  and  two  supplementary  angular  units  as  shown 
in  Table  A-1. 


Table  A-1.  SI  base  and  supplementary  units. 


QUANTITY 

NAME 

SYMBOL 

length 

meter 

m 

mass 

kilogram 

kg 

time 

second 

s 

electric  current 

ampere 

A 

thermodynamic  temperature 

kelvin 

K 

amount  of  substance 

mole 

mol 

luminous  intensity 

candela 

cd 

plane  angle 

radian 

rad 

solid  angle 

steradian 

sr 

Many  additional  units  are  eommonly  aeeepted  for  use  with  SI.  These  are  shown  in  Table  A-2. 


Table  A-2.  Units  in  use  with  SI. 


QUANTITY 

NAME 

SYMBOL 

DEFINITION 

time 

minute 

min 

1 min  = 60  s 

hour 

h 

1 h = 3600  s 

day 

d 

1 d = 86400  s 

plane  angle 

degree 

o 

1°  = (ti/180)  rad 

minute 

f 

V = (1/60)° 

second 

ff 

1"  = (1/60)' 

volume 

liter 

L 

1 L = 0.001  m^ 

mass 

metric  ton,  tonne 

t 

1 t = 1 000  kg 
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From  the  base  units  in  Tables  A- 1 and  A-2  are  derived  all  other  units  of  the  SI  system.  Some 
of  the  derived  units  have  specially-named  non-SI  counterparts  or  are  used  so  often  in  certain 
applications  that  they  have  been  given  special  names  and  symbols.  These  derived  units  are  listed 
in  Table  A-3. 


Table  A-3.  SI  derived  units. 


QUANTITY 

NAME 

SYMBOL 

EQUIVALENT 

frequency 

hertz 

Hz 

s’’ 

force 

newton 

N 

kg-m/s^ 

pressure,  stress 

pascal 

Pa 

N/m' 

work,  energy,  heat 

joule 

J 

N-m,  kg-mVs^ 

power 

watt 

W 

J/s 

electric  charge 

coulomb 

C 

A-s 

electric  potential,  emf 

volt 

V 

J/C,  W/A 

resistance 

ohm 

n 

V/A 

conductance 

siemens 

s 

A/V,  n ' 

inductance 

henry 

H 

Wb/A 

capacitance 

farad 

F 

C/V 

magnetic  flux 

weber 

Wb 

V-s,  N-m/A 

magnetic  flux  density 

tesla 

T 

Wb/m^,  N/A-m 

Celsius  temperature 

degree  Celsius 

°C 

K-  273.15 

luminous  flux 

lumen 

Im 

cd-sr 

illuminance 

lux 

lx 

Im/m^ 

radioactivity 

becquerel 

Bq 

s’’ 

absorbed  dose 

gray 

Gy 

m^-s'^,  J/kg 

dose  equivalent 

sievert 

Sv 

J/kg 

Several  non-SI  units  are  strongly  entrenched  in  their  respective  applications  and  have  no 
roughly  comparable  SI  counterparts.  As  a consequence  their  use  is  provisionally  accepted.  A few 
of  these  units  cannot  be  defined  precisely  in  terms  of  SI  units;  their  values  must  be  experimentally 
determined.  The  special  non-SI  units  that  are  provisionally  acceptable  are  listed  in  Table  A-4. 
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Table  A-4.  Non-SI  units  commonly  accepted  for  use. 


QUANTITY 

NAME 

SYMBOL 

EQUIVALENT 

cross  section 

barn 

b 

1 0'^®  m^ 

pressure 

bar 

bar 

10®  Pa 

radioactivity 

Curie 

Ci 

3.7  X 10’°  Bq 

exposure  to  xrays 

roentgen 

R 

2.58  X 10'^  C/kg 

absorbed  radiation 

rad 

rad 

0.01  Gy 

length 

Cngstrom 

C 

E 

O 

b 

energy 

electron  volt 

eV 

1.602176462  x 10  ” J 

mass 

atomic  mass  unit 

u 

1.66053873x  10  "^  kg 

The  prefixes  shown  in  Table  A-5  are  used  in  SI  to  create  units  larger  or  smaller  than  the  basic  ones 

Table  A-5.  SI  prefixes. 

FACTOR 

PREFIX 

SYMBOL 

FACTOR 

PREFIX 

SYMBOL 

10"^ 

yotta 

Y 

10’ 

deci 

d 

10"’ 

zetta 

Z 

10" 

centi 

c 

10’® 

exa 

E 

10® 

milli 

m 

10’® 

peta 

P 

10® 

micro 

p (often  u) 

10’" 

tera 

T 

10” 

nano 

n 

10’ 

giga 

G 

10’" 

pico 

P 

10® 

mega 

M 

10’® 

femto 

f 

10® 

kilo 

k 

10’® 

atto 

a 

10" 

hecto 

h 

10"’ 

zepto 

z 

10’ 

deka 

da 

10"^ 

yocto 

y 

Thus,  for  example,  1 kilometer  equals  10^  meters,  while  1 picosecond  equals  10  '^  seconds. 
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Non-SI  Systems  of  Units 

The  only  claim  that  the  SI  system  of  units  has  to  be  the  universal  system  of  units  is  that  the 
international  scientific  community  has  accepted  them  as  standard.  Many  other  systems  of  units 
could  just  as  easily  have  been  selected.  The  “English  system  of  units”  based  on  seconds, 
inches/feet/yards/miles,  and  ounces/pounds/tons  is  no  less  fundamental  nor  less  useful  than  the 
metric  system.  The  metric  system  claim  to  fame  is  based  almost  entirely  on  the  supposed  (and  in 
fact  almost  entirely  cultural)  superiority  of  the  decimal  system.  Other  cultures  have  used  number 
bases  other  than  10.  The  advent  of  computers  has  demonstrated  that  binary  units  might  be  even 
more  fundamental  than  decimal  units. 

Because  of  this  many  other  systems  of  units  have  been  independently  derived  by  the  founders 
of  a field  of  endeavor  and  have  continued  to  be  widely  used  until  recently.  All  of  the  original 
derivations  in  classical  physics  and  many  if  not  most  of  the  applications  occurred  before  the 
adoption  of  the  SI  system  of  units.  Thus,  anyone  attempting  to  find  information  in  a field  is  as  likely 
to  encounter  the  use  of  one  of  the  non-standard  systems  of  units  as  they  are  to  encounter  the  use  of 
SI  units.  The  first  few  sections  of  this  appendix  are  intended  as  Rosetta  Stones  with  respect  to 
systems  of  units. 

Many  of  the  systems  of  units  arose  from  the  study  of  different  aspects  of  electricity  and 
magnetism. [2]  At  least  five  major  unit  systems  arose  here:  Electrostatic  units  (esu  - a form  of  cgs 
units  - centimeter/gram/second  units),  Electromagnetic  units  (emu  - another  form  of  cgs  units), 
Gaussian  units  (the  dominant  form  of  cgs  units,  Heaviside-Lorentz  units,  and  Rationalized  MKSA 
(meter/kilogram/second/ampere  - identical  with  SI  units  as  far  as  electrical  and  magnetic  units  are 
concerned  and  is  essentially  the  direct  predecessor  of  SI  units).  The  various  unit  systems  are  defined 
in  terms  of  how  they  relate  to  fundamental  electrical  and  magnetic  quantities  and  to  Maxwell’s 
equations.  For  example,  different  units  may  have  a different  proportionality  between  the  electric 
field  (£)  and  the  charge  {q)  at  distance  (r) 


E = 


(A.1) 


They  may  have  different  proportionality  between  the  Ampere  force  per  unit  length  (dF/dP)  and  the 
currents  (/  and  /’)  in  two  long  wires  separated  by  distance  (d) 


dF 

dl 


= 2k^ 


ir 


(A.2) 


They  may  have  different  proportionality  between  electromotive  force  and  time  rate  of  change  of 
magnetic  induction  as  expressed  in  the  Maxwell  equation 


Vxf  = 


(A.3) 
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They  may  have  different  proportionality  between  magnetic  induction  (B)  and  a current  (I)  at  distance 
(r) 


B = — 

r 

(A.4) 

Using  these  proportionalities.  Maxwell’s  equations  take  the  form 

y E = A7ik^p 

(A.5) 

V X f = k^^^ 

' dt 

(A.6) 

V-5  = 0 

(A.7) 

W xB  = A7rk^aJ+ 

" k,  dt 

(A.8) 

T able  A-6  lists  the  values  of  each  of  the  four  proportionality  constants  for  the  five  maj  or  electromag- 
netic unit  systems.  Table  A-7  lists  the  units  and  the  associated  conversion  factors  to  the  equivalent 
SI  units  for  the  electrostatic,  electromagnetic,  Gaussian,  and  SI  units. 


Table  A-6.  Proportionality  constants  defining  the  different  electromagnetic  unit  systems. 


UNIT  SYSTEM 

ki 

l<2 

ks 

ks 

So 

Mo 

Electrostatic  (esu) 

1 

1/c' 

1 

1 

471 

1 

1/c- 

Electromagnetic  (emu) 

1 

1 

1 

4ti 

1/c" 

1 

Gaussian 

1 

1/c' 

1/c 

c 

4ti 

1 

1 

Heaviside-Lorentz 

1/471 

1 /471C^ 

1/c 

c 

1 

1 

1 

Rationalized  MKSA  (SI) 

1/4718o 

Po/4p 

1 

1 

1 

So 

Mo 
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The  four  proportionality  constants  are  not  completely  independent,  it  is  possible  to  show  that 


kjk^=  (A.9) 

k^  / k2k^k^  = (A.  10) 

and 

k2=H  k^  (A.ll) 

Additional  useful  equations  include  the  Lorentz  force  relation 

F / q = E + k^v  X B (A.12) 

constitutive  relations 

D — + k^P 

and 

H = -B-k^M 
Fo 

Using  the  constants  Sg,  fig,  and  k^  from  Table  A-6,  the  form  of  the  constitutive  relations  in  each 
system  of  units  can  be  determined. 

There  remain  many  important  equations  in  the  literature  that  are  not  easily  transformed  using 
the  results  above.  Table  A-8  lists  conversion  factors  for  electromagnetic  variables  in  Gaussian  and 
SI  units,  the  two  most  common  unit  systems.  To  convert  equations  from  Gaussian  to  SI  units,  the 
quantities  in  the  center  column  are  replaced  by  the  modified  quantities  in  the  right-hand  column, 
everywhere  that  they  arise. 


(A.13) 

(A.14) 
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Table  A-7.  Electromagnetic  Systems  of  Units.  In  equations  below,  c = 2.997925  x 10*  m/s. 


QUANTITY 

Electrostatic 

CGS  UNIT  SYSTEMS 
Electromaqnetic 

Gaussian 

MKS  UNITS 
SI  (MKSAl 

Length 

Centimeter 

Centimeter 

Centimeter 

Meter  (m) 

Mass 

Gram 

Gram 

Gram 

Kilogram  (kg) 

Time 

Second 

Second 

Second 

Second  (s) 

Force 

Dyne 
(10  " N) 

Dyne 
(10  " N) 

Dyne 
(10  " N) 

Newton  (N) 

Energy 

Erg 

(10-^  J) 

Erg 

(10-^  J) 

Erg 

(10-'  J) 

Joule  (J) 

Charge 

Statcoulomb 
(0.1 /c  C) 

Abcoulomb 
00  A) 

Statcoulomb 

(0.1/cC) 

Coulomb  (C) 

Current 

Statampere 

(0.1/cA) 

Abampere 
00  A) 

Statampere 

(0.1/cA) 

Ampere  (A) 

Electric  Field 
Field 

Statvolt/cm 
(c/10^  V/m) 

Abvolt/cm 
(10-^  V/m) 

Statvolt/cm 
(c/1 0^  V/m) 

Volt/meter 

Electric 

Potential 

Statvolt 
(c/10"  V) 

Abvolt 
(10  ® V) 

Statvolt 
(c/1 0®  V) 

Volt  (V) 

Electric 

Polarization 

Statcoul/cm^ 
(10V47IC  C/m') 

Abcoul/cm^ 
(^0-V4n  C/m'' 

Statcoul/cm' 
(10V4tic  C/m') 

C/m' 

Electric 

Displacement 

Statcoul/cm' 
(10V471C  C/m') 

Abcoul/cm' 

(10  V47T  C/m') 

Statcoul/cm' 
(10V471C  C/m') 

C/m' 

Resistance 

Statohm 
(c'/lO"  Q) 

Abohm 

(10’Q) 

Second/cm 
(c'/IO®  Q) 

Ohm  (Q) 

Capacitance 

Statfarad 
(lOVc'  F) 

Abfarad 
(10’  F) 

cm 

(lOVc'  F) 

Farad  (F) 

Magnetic 

Flux 

Statvolt-second 
(c/10"  Wb) 

Maxwell 
(10  ® Wb) 

Maxwell 
(10-®  Wb) 

Weber  (Wb) 

Magnetic 

Induction 

Statvolt-sec/cm' 
(c/1  o'  Wb/m'^ 

Gauss 

(10^) 

Gauss 

(10-^) 

Tesla  (Wb/m'^ 

Magnetic 

Field 

Statamp-turn/cm 
(1  0V4tic  A-turn/m) 

Oersted 

(1  0V4ti:  A-turn/m) 

Oersted 

(1  0V4ti:  A-turn/m) 

Amp-turn/m 

Magnetization 

Statvolt-sec/cm' 
(c/4tix1  o'  Wb/m') 

Gauss 

(10  V4ti  Wb/m') 

Gauss 

(10-V471  Wb/m" 

Wb/m' 

Inductance 

Stathenry 
(c'/lO"  Hy) 

Abhenry 
(10  ’ Hy) 

Stathenry? 
(c'/IO®  Hy) 

Henry  (Hy) 

Permittivity 

8 (so=l) 

sc'  (So=1/c') 

£ (So=1) 

s/sq 

Permeability 

Kic'  (Kio=l/c^) 

1^  (^0=0 

1^  (^0=0 
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Table  A-8.  Conversion  factors  for  variables  in  Gaussian  and  SI  units  systems. 


VARIABLE 

VELOCITY  OF  LIGHT 
ELECTRIC  FIELD 
ELECTRIC  POTENTIAL 
VOLTAGE 

ELECTRIC  DISPLACEMENT 
CHARGE 

CHARGE  DENSITY 
CURRENT 
CURRENT  DENSITY 
ELECTRIC  POLARIZATION 
AAAGNETIC  INDUCTION 
MAGNETIC  FIELD 
AAAGNETIZATION 
CONDUCTIVITY 

DIELECTRIC  CONSTANT  (PERMITTIVITY) 

PERMEABILITY 

RESISTANCE 

IMPEDANCE 

INDUCTANCE 

CAPACITANCE 


GAUSSIAN  SI 


c 

E 

(4;tsS"e 

O 

V 

{47ts,Y'v 

b 

{47is^Y^b 

q 

ql(47isbj'^^ 

p 

pl(47Tsb)^'^ 

I 

//(4;r^o)" 

j 

J / {4716qY^ 

p 

PI(471s^Y^ 

B 

{4nl  pb)'^B 

H 

[4kp,Y"h 

M 

(//q  / 4ttY^  M 

a 

a ! {4  kSq) 

s 

s! 

P 

p!  p^ 

R 

(4tis^)R 

Z 

(4;rs,)Z 

L 

(4;rSo)L 

C 

C / {47T£q) 
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Natural  Units 


In  specialized  fields  (like  relativistic  quantum  theory  or  atomic  physics)  there  are  often 
certain  quantities  that  are  “natural”  choices  for  units.  For  example,  consider  a warehouse.  In  this 
warehouse  all  materials  are  stored  on  pallets.  Shipments  involve  receiving  or  sending  of  complete 
pallets.  As  inventories  shift,  complete  pallets  are  moved  from  location  to  location  to  maintain  a 
logical  and  efficient  storage  system.  In  such  a warehouse,  the  logical  unit  is  a “pallet”.  When  a 
worker  says  he  just  relocated  37  Commodity  X from  site  A to  site  D,  the  “natural  unit”  of  “pallet” 
can  be  assumed  without  its  explicit  inclusion.  This  type  of  thinking  has  occurred  numerous  times 
in  various  fields  of  physics. 

In  relativistic  quantum  theory  [3]  it  is  natural  to  use  velocities  in  units  of  c (the  speed  of 
light)  and  actions  (energy  times  time)  in  units  of  h (Planck’s  constant  divided  by  2n).  Because  of 

this,  it  is  not  uncommon  for  relativistic  quantum  theorists  to  set  c = /i  = 1 in  equations.  Use  of  these 

natural  units  leads  to  considerable  simplification  of  the  equations.  All  computations  can  be 
performed  in  natural  units  and  at  the  very  end,  if  it  is  desired  to  express  the  results  in  SI  units, 
powers  of  h and  c can  be  added  back  into  the  final  equation  to  make  the  dimensions  correct  (see  the 
section  on  dimensional  analysis  later  in  this  appendix). 

In  atomic  physics  [4],  it  is  common  to  use  atomic  units,  a set  of  natural  units  introduced  by 
Hartree  as  a modification  of  the  Gaussian  system  of  units.  [5]  In  atomic  units,  e = ^ = = 1 in  all 

equations.  In  atomic  units,  the  unit  of  charge  must  be  the  electronic  charge  and  the  unit  of  mass 
must  be  the  electron  mass.  The  unit  of  length  will  become  the  Bohr  radius 

The  unit  of  energy  is  twice  the  ionization  potential  of  the  hydrogen  atom  = The  unit  of 

velocity  is  the  electron  velocity  in  the  first  Bohr  orbit  = e^/ti  = ac  = the  fine  structure  constant  times 
the  speed  of  light.  The  unit  of  time  = and  so  on.  Once  again,  at  the  end  of  all  calculations, 

powers  of  fi,  m^,  and  e can  be  added  back  in  based  on  dimensional  analysis. 

Other  natural  unit  systems  may  be  encountered.  People  working  on  unified  field  theories  or 
quantum  gravity  might  use  units  in  which  G = h = c=  1 . The  unit  of  mass  would  then  become  the 

Planck  mass,  the  unit  of  length  would  become  the  Planck  length,  and  the  unit  of  time  would  become 
the  Planck  time  (see  Table  A-  for  definition  of  the  Planck  units).  It  is  even  conceivable  that 
someone  might  sete  = h = m^  = c=\.  The  author  remembers  hearing  a discussion  of  this  possibility 

as  a student,  but  does  not  remember  any  details  and  has  never  seen  it  successfully  used.  It  is 
probably  apocryphal,  because  c = 1 would  require  the  unit  of  velocity  to  be  the  speed  of  light,  while 

Q = fi  = m^=  \ requires  the  unit  of  velocity  to  be  ac,  an  inconsistency.  Setting  too  many  fundamental 
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constants  equal  to  unity  will  result  in  such  inconsistencies. 
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Decibels  and  Logarithmic  Units 

The  level  of  a field  quantity  F (such  as  electric  field  strength,  voltage,  pressure,  etc.)  is 
defined  by  the  relation[6] 

Lp=\n(FIF^)  (A.la) 

where  Fg  is  a reference  amplitude.  The  level  of  a power  quantity  P (such  as  power,  intensity,  etc.) 
is  defined  by  the  relation 

Lp  - 0.5ln[P / Pq)  (A.lb) 

where  Pg  is  a reference  power.  Since  power  quantities  are  usually  the  squares  of  associated  field 
quantities,  i.e.,  P = F\  then  the  levels  are  related  by 

Zp  =0.51n(P/Po)  = 0.51n(F"/Fo")  = ln(F/Fo)  = Z^.  (A.2) 

The  definitions  have  been  arranged  such  that  the  levels  of  the  field  quantities  and  their  associated 
powers  are  equal.  When  expressed  as  a level,  there  is  no  need  to  distinguish  between  fields  and 
powers. 

One  neper  (Np)  is  the  level  of  a field  quantity  when  F/Fg  = e,  that  is,  when  ln(F/Fg)  = L 
Thus,  we  see  that  the  fundamental  unit  of  level  for  field  quantities  is  nepers.  The  word  neper  is 
derived  from  John  Napier’s  (the  inventor  of  the  logarithm)  spelling  of  his  own  name.  By  examina- 
tion of  Eqs.  (A.  1)  and  (A.2)  we  see  that  one  neper  must  be  the  level  of  a power  quantity  when  P/Pg 
= F,  that  is,  when  0.5  ln(P/Po)  = F The  level  equations  Eq.  (A.l)  have  units  of  nepers 


Lp=\n(F!F,)  Np 

(A.3a) 

and 

Zp  =0.51n(Z/Po)  Np. 

(A.3b) 

One  bel  (B)  is  the  level  of  a field  quantity  when  F/Fg  = that  is,  when  2 \og{F/Ff^  = L 

In  the  same  vein,  one  bel  is  the  level  of  power  quantity  when  P/Pg 
1. 

In  units  of  bels,  the  level  equations  become 

= 10,  that  is,  when  \og{P/P^  = 

if  =21og(F/F„)  B 

(A.4a) 

and 

if  = log(F/F„)  B. 

(A.4b) 
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Note  that  the  bel  is  not  simply  the  base- 10  logarithm  equivalent  of  the  base-e  unit  called  the  neper. 
The  neper  is  naturally  defined  with  respect  to  the  field  quantities,  while  the  bel  is  naturally  defined 
with  respect  to  the  power  quantities.  Both  bel  and  neper  differ  from  most  other  units  in  the  respect 
that  they  are  logarithmic  units  as  opposed  to  arithmetic  units. 

Logarithmic  units  are  convenient  when  measured  values  can  vary  by  many  orders  of 
magnitude.  As  we  will  see  below,  the  lowest  sound  pressure  detectable  by  human  hearing  is  roughly 
1 pPa  (=  0 dB);  hearing  damage  can  begin  to  occur  at  extended  exposures  to  sound  pressures  even 
lower  than  0.1  Pa  = 10^  pPa  = 100  dB;  death  will  occur  at  sound  levels  of  around  100  kPa  = 10" 
pPa  = 220  dB.  Given  that  interesting  sound  pressure  levels  vary  over  at  least  1 1 orders  of  magnitude 
when  described  in  the  linear  units  of  Pascals,  a logarithmic  unit  such  as  the  decibel  has  the  potential 
benefit  of  only  varying  from  0 dB  to  220  dB). 

In  the  acoustics  community,  the  decibel  (dB)  is  the  universally  used  and  accepted  unit  of 
sound  intensity  I or  sound  pressure  p.  Technically,  both  I and p are  levels  (i.e.,  I = sound  intensity 
level  and  p = sound  pressure  level).  However,  in  common  usage  the  word  level  is  often  omitted. 
One  decibel  is  one-tenth  of  a bel.  The  use  of  decibel  rather  than  bel  undoubtedly  arose  from 
practical  considerations.  Early  measurement  accuracies  were  likely  of  the  order  of  1 decibel.  Thus, 
data  could  be  reported  in  integer  decibels.  The  use  of  bels  would  have  lead  to  many  fractional 
numbers  and  more  complex  record-keeping. 

The  acoustic  units  are  thus  defined  by  the  relations  [7] 


/(in  dB)  = 101og[///o] 

(A.5a) 

p{m  dB)  = 201og[/7(in  //Pa)  / /?q] 

(A.5b) 

where  Ig  is  a reference  intensity  that  is  produced  by  a reference  sound  pressure pg.  Acoustic  intensity 
is  proportional  to  the  square  of  acoustic  pressure.  In  atmospheric  acoustics,  the  reference  pressure 
is  Pg  = 20  pPa;  in  underwater  acoustics,  the  reference  pressure  is/^^  = 1 pPa.  The  20  pPa  reference 
level  was  chosen  because  it  is  the  lowest  sound  pressure  that  can  be  perceived  by  the  average  young 
adult  in  a perfectly  quiet  environment.  All  audible  sounds  will  have  positive  levels.  Sounds  with 
negative  levels  will  never  be  audible.  Since  audibility  is  seldom  a consideration  in  underwater 
sound  measurements  (humans  were  not  designed  to  hear  well  underwater)  and  given  that  the 
acoustic  impedance  of  sea  water  is  considerably  different  than  air,  it  was  reasonable  to  assume  a 
more  obvious  standard  (i.e.,  some  power  of  ten  relative  to  the  SI  unit  of  pressure,  the  Pascal). 

Other  applications  have  found  logarithmic  units  to  be  valuable,  such  as  the  Richter  scale  of 
earthquake  magnitudes.  However,  rather  than  developing  a new  and  unique  name,  many  of  these 
other  applications  have  adopted  the  term  decibels.  In  this  more  general  application, 

X(in  dB)  = 1 0 log[X  / ] (A.6) 
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where  Xfl  is  a reference  level.  Carrier-to-Noise  Ratio  = Carrier  Power/Noise  Power  = CNR  is  almost 
universally  given  in  units  of  decibels  (the  noise  power  acts  as  a convenient  reference  level).  Note 
that  if  two  quantities  are  related  by  one  being  the  square  of  the  other,  the  squared  quantity  (e.g., 
power  or  sound  intensity)  has  a multiplier  10  in  Eq.  (A.5a)  while  the  other  quantity  (e.g.,  voltage  or 
sound  pressure)  has  a multiplier  of  20.  Refer  back  to  Eq.  (A.4a  & b) 

In  some  instances  Xg  is  set  equal  to  1 (dimensionless).  In  this  case  then  the  decibel  value  of 
X carries  an  explicit  unit.  For  example,  in  microwave  transmission  systems  it  is  convenient  to 
discuss  microwave  power  level  in  units  of  dBW,  that  is, 

/^(indBW)  =101og[P(in  W)].  (A.7) 

The  decibel  units  may  become  fairly  complex  as  in  the  case  of  electromagnetic  intensity 

0(in  dBJ  / - s)  = 101og[P(in  J / - s)] . (A.8) 

The  special  unit  dBi  is  sometimes  used  in  discussions  of  antenna  performance.  In  this  case  the  “i” 
refers  to  “isotropic”.  An  antenna  with  a gain  in  a particular  direction  of  -20  dBi  has  a gain  that  is 
20  dB  below  that  of  an  isotropic  radiator  (equally  intensity  emitted  in  all  directions)  at  the  same 
wavelength. 

The  choice  between  using  decibel  (logarithmic)  or  linear  units  may  depend  on  something  as 
simple  as  which  type  of  units  is  easier  to  graph.  For  example,  exponential  curves  plot  as  straight 
lines  when  one  of  the  units  is  logarithmic.  One  could  plot  the  exponential  using  a logarithmic  scale 
(semi-log  paper)  or  one  could  plot  the  exponential  on  linear  scaled  paper  with  logarithmic  units.  If 
the  plot  covers  more  than  5 orders  of  magnitude  the  decibel  plot  will  be  much  less  cluttered  with 
grid  lines  than  the  equivalent  semi-log  plot. 

When  using  logarithmic  units,  one  must  be  careful  not  to  improperly  mix  logarithmic  and 
linear  quantities.  For  example,  the  expression  for  the  carrier-to-noise  ratio  of  a radar  may  be  written 
in  the  linear  form 

CNR^aPD^  a ! (A.9) 

where  P is  in  W,  D is  in  m,  cris  in  m^,  R is  in  m,  and  a is  a collection  of  constants  and  factors  that 
make  CNR  dimensionless.  Alternately,  it  may  be  written  in  logarithmic  form 

1 0 log[CA^i?]  = 1 0 log[a  PD^  (7 ! R^]  (A.  10) 

or 
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(A.11) 


CNRiin  dB)  = lOlog^?  + P(in  dBW)  + 401og(D  in  m) 

+ cr(in  dBsm)  - 401og(i?  in  m) 

where  several  of  the  units  have  been  expressed  in  logarithmic  form  (decibel  form).  In  particular  the 
radar  cross  section  is  expressed  in  dBsm  (decibel  square  meters  — the  radar  cross  section  in  decibels 
relative  to  1 square  meter).  CNR  (in  dB)  must  be  converted  to  CNR  (linear)  before  it  may  be  used 
in  equation  (A.9).  P and  crmust  be  similarly  converted.  It  is  common  for  some  parameters  used  in 
a calculation  to  be  expressed  logarithmically  and  others  to  be  expressed  linearly.  It  is  essentially 
that  one  form  or  the  other  must  be  chosen  before  computations  are  made,  and  all  parameters  must 
be  converted  to  the  chosen  form.  Failure  to  do  this  is  an  exceedingly  common,  easily  understand- 
able, and  absolutely  inexcusable  error. 

Given  the  difficulties  that  logarithmic  units  present,  one  may  wonder  why  we  bother  with 
them.  The  answer  is  tradition.  In  the  not  so  distant  past  (roughly  1975),  before  computers  and 
scientific  calculators  became  commonplace,  the  logarithmic  form  was  often  more  easily  calculated 
(sums  and  differences  are  intrinsically  easier  than  multiplications  and  divisions).  Much  of  our 
current  use  of  decibel  units  is  a holdover  from  the  much  earlier  “necessity”  to  use  the  logarithmic 
equations  rather  than  the  linear  equations.  In  some  fields,  key  equations  (those  often  used  to 
perform  computations)  are  still  given  in  logarithmic  form  and  commonly  used  graphs  are  plotted  in 
logarithmic  units,  and  all  of  the  experienced  practitioners  are  still  fluent  in  the  use  of  the  logarithmic 
forms.  Since  junior  personnel  often  emulate  successful  older  practitioners,  the  junior  people  keep 
the  tradition  alive. 

Furthermore,  there  is  often  a need  to  do  quick  and  dirty  calculations.  These  are  usually 
referred  to  as  “back-of-the-envelope”  calculations.  Unfortunately,  the  designer  sometimes  does  not 
have  even  an  envelope  handy.  Calculations  using  the  add-and-subtract  logarithmic  form  of  the 
equation  and  quick  decibel  approximations  to  the  numbers  can  often  be  performed  quickly  in  one’s 
head,  and  may  yield  answers  of  acceptable  accuracy.  If  one  remembers  that 

1 =0dB 

1.25  = 1 dB 

1.6  =2dB 

2 =3dB 

2.5  = 4 dB 

e = 4.343  dB 

n =5dB 

4 =6dB 

5 =7dB 

6.25  = 8 dB 

8 =9dB 

10  =10dB 

to  good  accuracy,  then  mental  computations  are  considerably  simplified.  Although  the  need  to  have 
logarithmic  units  has  almost  disappeared,  it  is  assured  that  system  designers  will  continue  to 
encounter  them  for  a long  time  to  come. 
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Units  of  Attenuation 


Many  physical  quantities  lose  strength  (attenuate)  as  they  pass  through  material  media.  In 
many  instances,  the  attenuation  as  a function  of  distance  traveled  (R)  takes  an  exponential  form 

X(R)  = . (A.12) 

where  oris  the  attenuation  coefficient.  Rearranging  equation  (A. 8)  and  taking  the  natural  logarithm 
of  both  sides  yields 

a7?  = -ln[x(i?)/X(0)].  (A.13) 

This  bears  a resemblance  to  the  definition  of  a level.  However,  we  should  first  remark  that  Ais  more 
often  a power  quantity  rather  than  a field  quantity.  If  A is  a power  quantity,  then  we  observe 

aR  = - ln[  A(i?)  / A(0)1  = -2(0.5  \n[x(R)  / A(0)1) 

(A.  14) 

= -2([V(fi)/V(0)]  in  Np) 

In  fact  the  quantity  0.5  aR  is  assigned  the  dimensionless  unit  of  neper.  One  neper  implies  an 
attenuation  in  A of  a factor  of  1/e^.  Since  R has  units  of  length,  the  attenuation  coefficient  a must 
have  units  of  nepers/unit  length.  Multiplying  a by  the  distance  traveled  R give  the  total  number  of 
nepers  (powers  of  1/e^)  of  attenuation  the  quantity  A has  undergone.  In  practice,  the  term  neper  is 
almost  never  used  anymore  and  attenuation  coefficients  are  given  units  of  inverse  distance  (e.g.,  a 
might  have  units  of  km  ').  The  neper  and  the  hidden  factor  of  two  are  quietly  assumed.  Since  the 
attenuation  coefficient  always  multiplies  a distance  and  always  goes  into  an  exponential,  this  usually 
causes  no  problems. 

Unfortunately,  in  fields  where  decibels  are  commonly  used,  attenuation  coefficients  are 
commonly  given  in  decibels  per  unit  length  (or  total  decibels  over  a fixed  length).  This  leads  to 
another  common  cause  of  error.  You  can’t  mix  decibels  and  nepers  in  the  same  equation.  Consider 
the  example  of  atmospheric  transmission  of  electromagnetic  radiation  intensity  as  defined  by  the 
exponential  equation 

r = = /(i?)  / /(O)  . (A.14) 

In  decibel  form 

r(in  dB)  = -^(in  dB/km)-7?(in  km)  = 101og|^e~“^  j (A.15) 
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where  we  have  called  A the  attenuation  coefficient  in  decibel  form.  In  neper  form 

r(in  Np)  = 0.5^(111  km'*) •i?(in  km)  = -0.51n|^e““^  j . (A.16) 

If  we  wish  to  express  a in  Np/km  then  we  must  divide  it  by  2. 

«(in  Np  - km'^)  = 0.5d'(in  km'^)  (A.17) 

Note  that  if  the  transmission  equation  is  given  as  a ratio  of  field  quantities,  then  the  factor  of  two 
disappears  and  a(in  Np-km‘)  and  a(in  km‘)  are  identical. 

Since  the  transmission  must  be  the  same  ratio  regardless  of  how  it  is  calculated  we  must  have 
the  equality 


jQ-0.1^(in  dB-km  *)-i?(in  km)  _ ^-«(in  km  ‘)-i?(in  km) 


(A.  18) 


Taking  the  natural  logarithm  of  both  sides  of  this  equation  yields 

ct(inkm')  ct(inkm')  ^ 

0.1  In  10  ~ 0.2303 

for  comparing  the  decibel  and  per  kilometer  forms  of  the  attenuation  coefficient.  Since  attenuation 
coefficients  are  used  as  exponents,  failure  to  convert  to  the  proper  form  for  an  equation  will  lead  to 

significant  errors.  It  is  very  common  for  the  total  attenuation  to  be  expressed  ase  in  a 
computation,  yet  have  a be  expressed  in  dB/km.  In  this  example,  a must  be  converted  to  km  ' form 
before  computing  the  exponential. 


343ct(in  km'  ) (A.19) 
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Dimensional  Analysis 

Dimensional  analysis  is  an  extremely  simple  but  powerful  tool  for  eliminating  errors  creating 
information.  The  fundamental  principles  of  dimensional  analysis  are  that  (a)  the  net  dimensions  of 
one  side  of  an  equation  must  equal  the  net  dimensions  of  the  other  side  of  that  equation,  and  (b)  the 
dimensions  of  any  specific  physical  quantity  (regardless  of  how  it  is  expressed)  must  have  a unique, 
yet  universally  specific  form  when  reduced  to  fundamental  units  of  mass,  length,  time,  and  charge. 
Principle  (b)  deserves  more  elaboration.  Energy  can  be  expressed  or  calculated  in  many  ways: 
kinetic  energy  V2  mv\  gravitational  potential  energy  mgh,  rotational  energy  V2  Io9,  thermal  energy 
kT,  or  electrical  potential  energy  However,  energy  is  energy  and  must  ultimately  have  units  of 
mass  X length^  / time^.  All  five  of  the  preceding  expressions  when  expressed  in  units  of  mass, 
length,  time,  and  charge,  indeed  reduce  to  the  fundamental  form  of  mass  x length^  / time^.  The 
fundamental  forms  of  a number  of  physical  quantities  and  a few  common  physical  constants  are 
summarized  in  Table  A-7.[8]  ft  should  be  noted  that  there  are  a few  older  systems  of  units  that  have 
different  dimensionality  for  the  fundamental  expression.  The  Gaussian  set  of  units  used  in 
electromagnetism  is  one  of  these.  The  primary  differences  occur  only  in  electrical  or  magnetic 
quantities.  Energy  will  always  ultimately  end  up  with  units  of  mP/t^. 

As  mentioned  above,  application  of  these  principles  can  reduce  errors.  Consider  this 
problem.  You  find  an  expression  for  a quantity  you  need  in  an  old  textbook.  You  are  not  sure  this 
book  consistently  uses  SI  units.  The  formula  is  the  Klein-Nishina  formula  for  Compton  scattering. 


/.  2 / \ 

2 

( V v'  ■ 2 

in  cm  / sr  = 

— 

— + Sin  6 

1 y ) 

(A.20) 


The  differential  cross  section  supposedly  is  in  units  of  cm^/sr.  You  do  a quick  dimensional  check 
after  eliminating  the  dimensionless  frequency  terms  in  brackets. 


da 

^ m¥ 


(A.21) 


This  expression  has  units  of  charge  in  it  and  is  clearly  not  reducible  to  units  of  length  squared.  In 
SI  units  many  quantities  with  charge  that  are  not  obviously  electrical  expressions  (like  Ohm’s  law) 
have  the  permittivity  of  free  space  4ti:8o  somewhere  in  the  expression.  Our  expression  does  not. 
Looking  in  Table  we  find  that  permittivity  has  units  of  f q^  / m f . If  we  divide  by  (4;t£’o)^  we  can 
eliminate  all  of  the  charges. 


da  ^ 


(A.22) 


The  dimensionally  correct  expression  becomes 
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(A.23) 


dcr  /. 

in  cm 

dQ^ 


2 


327T^£lm^c^ 


r..'Vry  y>  . A 

1 sin  0 


V 

yyj 


V V V 


) 


The  reader  may  be  interested  to  note  that  this  is  a real-life  example,  and  at  one  time  the  author  did 
need  to  perform  a dimensional  analysis  to  verify  the  proper  form  of  the  equation,  as  there  was  an 
error  in  the  source. 


It  is  much  more  common  to  catch  simple  mistakes  in  exponents,  such  as  expressing  distance 
traveled  by  an  accelerating  object  as 


(A.24) 


A quick  dimensional  check  reveals 


( ' 1 

h)- 

f'l 

vt. 

(A.25) 


which  is  obviously  incorrect  by  one  power  of  time.  The  incorrect  equation  can  be  rapidly  altered 
to  read  correctly 


(A.26) 


Even  more  complicated  exponent  mistakes  can  be  caught  and  diagnosed.  For  example,  we 
may  express  the  electron  plasma  frequency  as 

ne^ 

G)^= (A.27) 

where  n is  the  electron  density  (1'^).  Dimensional  analysis  shows 


1 1 
t (tV/mP)(m) 


The  term  on  the  right-hand  side  has  a dimension  which  is  the  square  of  the  term  on  the  left-hand 
side.  In  this  case  it  is  easy  to  see  that  the  correct  expression  for  electron  plasma  frequency  should 
be 


r 2 

ne 


(A.29) 


These  examples  should  serve  to  convince  the  reader  of  the  power  and  utility  of  dimensional  analysis. 
It  is  a tool  that  is  all  too  infrequently  used,  to  the  detriment  of  everyone. 
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Table  A-7.  Dimensions  of  Physical  Quantities.  The  basic  dimensions 
are  mass  (m),  length  (I),  time  (t),  charge  (q),  and  temperature  (K). 


Physical  Quantity  Symbol 

Dimensions 

Physical  Quantity  Symbol 

Dimensions 

Acceleration 

a 

i/t' 

Magnetic  Flux 

0) 

m P / t q 

Angular  Momentum 

J 

m iVt 

Magnetic  Induction 

B 

m / 1 q 

Angular  Velocity 

CO 

1 /t 

Magnetic  Moment 

P 

P q / 1 

Area 

A 

P 

Magnetization 

M 

q/lt 

Boltzmann  Constant 

k 

m P / t^  T 

Magnetomotive  Force 

q/t 

Capacitance 

C 

t'  qV  m P 

Mass 

m,  M 

m 

Charge 

q 

q 

Moment  of  Inertia 

P 

m P 

Charge  Density 

p 

q/l^ 

Momentum 

P,P 

m 1 / 1 

Conductance 

t 

q V m P 

Momentum  Density 

- 

m / P t 

Conductivity 

a 

t q^  / m 1^ 

Permeability 

P 

m 1 / q^ 

Current 

1 

q /t 

Permittivity 

8 

P q V m P 

Current  Density 

J 

q/Pt 

Planck  Constant 

h,  h 

m P/t 

Density 

P 

m / P 

Polarization 

p 

q / P 

Displacement,  Range 

d,  X,  R 

1 

Power 

p 

m P / P 

Electric  Displacement 

D 

q / P 

Power  Density 

- 

m / 1 1^ 

Electric  Field 

E 

m 1 / t^  q 

Pressure 

p,p 

m/lP 

Electric  Potential, 

V,  d> 

m P / t^  q 

Reluctance 

01 

q^  / m P 

Electromotive  Force 

s 

m P / t^  q 

Resistance 

R 

m P / t q^ 

Energy 

u,  w 

m P / P 

Resistivity 

h/  P 

m P / t q^ 

Energy  Density 

W,  8 

m / 1 P 

Temperature 

T 

T 

Force 

F 

m I/P 

Thermal  Conductivity 

K 

m 1 / 1^ 

Frequency 

f,  V,  CO 

1 /t 

Time 

t 

t 

Gas  Constant 

R 

m P / P T mole 

Torque 

X 

m P / P 

Gravitational  Constant 

G 

P / m P 

Vector  Potential 

A 

m 1 / t q 

Impedance 

Z 

m P / t q^ 

Velocity,  Speed 

V 

l/t 

Impulse 

- 

m 1 / 1 

Viscosity 

r|,  p m 

/It 

Intensity 

1 

m / 1^ 

Volume 

V 

P 

Inductance 

L 

m P / q^ 

Vorticity 

C 

1 /t 

Length 

1 

1 

Work 

w 

m P / P 

Magnetic  Intensity 

H 

q/lt 
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Useful  Physical  Constants 

The  derivations  and  physical  arguments  presented  in  the  text  make  frequent  use  of  numerical 
estimates.  To  facilitate  these  and  other  such  estimates,  in  Table  A-6  we  present  a list  of  fundamental 
physical  constants  (in  SI  units).  The  values  listed  are  from  the  2002  values  [11]  recommended  by 
the  Committee  on  Data  for  Science  and  Technology  of  the  International  Council  of  Scientific  Unions 
(CODATA).  They  are  the  best  self-consistent  set  of  values  available  and  have  replaced  those  from 
the  last  CODATA  least  squares  adjustments  completed  in  1986  [9]  and  1998  [10].  A number  of 
other  useful  constants  extracted  from  the  CRC  Handbook  of  Chemistry  and  Physics  [12]  and  Lang's 
Astrophvsical  Data  [ 1 3]  are  also  included.  In  most  cases,  the  last  digit  given  for  each  experimentally 
determined  constant  is  uncertain. 


Table  A-6.  Useful  physical  constants. 


PHYSICAL  CONSTANT 

SYMBOL 

VALUE 

UNIT 

pi 

71 

3.1415926535897932 

— 

base  of  natural  (Naperian)  logarithms 

e 

2.7182818284590452 

--- 

Euler's  constant 

Y 

0.57721566490153286 

— 

speed  of  light 

c 

2.99792458  X 10® 

m/s 

permittivity  of  free  space 

8.8541 8781 7 X 10  '^ 

F/m 

permeability  of  free  space 

ho 

4ti  X 1 0'^ 

N-s®/C® 

impedance  of  free  space  ([Po/^ol^^^) 

Zo 

376.730313461 

Q. 

Planck  constant 

h 

6.6260693  X 10  ®" 

J-s 

" " (h/27i) 

1.05457168  X 10  ®" 

J-s 

elementary  charge 

e 

1.60217653  X 10  ’’ 

C 

Boltzmann  constant 

k 

1.3806505  X 10  ®® 

J/K 

electron  mass 

me 

9.1093826  X 10  ®’ 

kg 

" mass-energy 

m^c^ 

0.510998918 

MeV 

" magnetic  moment 

he 

-9.28476412  x 10  ®" 

J-r’ 

muon  mass 

1.88353140  X 10  ®® 

kg 

" mass-energy 

105.6583692 

MeV 

" magnetic  moment 

-4.49044799  x 10  ®® 

J-r’ 

tau  mass 

3.16777  X 10  ®® 

kg 

" mass-energy 

1776.99 

MeV 

proton  mass 

mp 

1.67262171  X 10®® 

kg 

" mass-energy 

fTIpC^ 

938.272029 

MeV 

" magnetic  moment 

hp 

1.41060671  x10®®  J-r’ 

neutron  mass 

1.67492728  X 10  ®® 

kg 

" mass-energy 

m„c^ 

939.565360 

MeV 

" magnetic  moment 

hn 

-0.96623645  X 10  ®® 

J-r’ 
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Table  A-6.  (cont.)  Useful  physical  constants. 


PHYSICAL  CONSTANT  SYMBOL  VALUE  UNIT 


deuteron  mass 

" mass-energy 

" magnetic  moment 

helion  mass 

" mass-energy 
" magnetic  moment 
alpha  mass 

" mass-energy 
hydrogen  atom  mass 
atomic  mass  unit 
Avogadro  number 
molar  mass  of  hydrogen  atoms 
fine  structure  constant  (eV28ohc) 

Rydberg  constant  (mgeVSso^h^c) 

II  II 

Bohr  radius  (sohVTtmge^) 

Compton  wavelength  of  electron  (h/mgC) 

" (h/27rm,c) 

classical  electron  radius  (eV47rSomgC^) 
Thomson  cross  section  (87irg^/3) 

Bohr  magneton  (eh/47imj 
nuclear  magneton  (eh/47imp) 
magnetic  flux  quantum  (h/2e) 
quantum  of  circulation  (h/2mj 
Stefan-Boltzmann  constant  (27r®kVl5hV) 
first  radiation  constant  (27ihc^) 
second  radiation  constant  (hc/k) 

Wien  displacement  law  constant, 
ideal  gas  constant 
standard  atmosphere 

molar  volume  of  ideal  gas  (273. 1 5K,  1 atm) 

density  of  air  at  STP  (273.1 5K,  latm) 

Loschmidt  constant  (N^/VJ 

Faraday  constant 

Newtonian  constant  of  gravitation 

standard  acceleration  of  gravity 

Planck  mass  ([hc/27iG]’^^) 

Planck  length  ([hG/27ic^]’^^) 

Planck  time  ([hG/27ic^]’^^) 

Fermi  coupling  constant 
weak  mixing  angle 


3.34358335  X 10-'^ 

kg 

m^c^ 

1875.61282 

MeV 

hd 

0.43307342  x 10'^^ 

J-T" 

5.00641214  X lO'^" 

kg 

m^c^ 

2808.39142 

MeV 

hh 

-1.074553024  X 10-^* 

J-T" 

6.6446565  X 10-'^ 

kg 

3727.37917 

MeV 

mH 

1.6735325  x 10‘^^ 

kg 

u 

1.66053886  X 10'^" 

kg 

Na 

6.0221415  X 10^^ 

mol" 

M(H) 

1.007825034  X 10'^ 

kg 

a 

7.297352568  X 10'^ 

— 

K 

1.0973731568525  X 10" 

m" 

3.289841 960360  X 10'® 

Hz 

°0 

5.291772108  X 10  " 

m 

2.426310238  X 10  " 

m 

K 

3.861 592678  X 10  " 

m 

re 

2.81  7940325  X 10  " 

m 

0.665245873  X 10"® 

m" 

he 

9.27400949  x 10"^ 

J/T 

hN 

5.05078343  X 10"" 

J/T 

% 

2.06783372  x 10  " 

Wb 

3.636947550  X 10'^ 

.-1 

m -s 

a 

5.670400  X 1 0 ® 

W/m"-K^ 

Cl 

3.741  771 38  X 10  " 

W-m" 

C2 

0.014387752 

m-K 

b 

2.8977685  X 10" 

m-K 

R 

8.314472 

J/mol-K 

atm 

1.013250000  X 10® 

Pa 

V. 

22413.996 

cm®/mol 

Pa 

1.285 

kg/m® 

r>o 

2.6867773  X 10"® 

m" 

F 

96485.3383 

C/mol 

G 

6.6742  X 10  " 

m®/kg-s" 

g 

9.80665 

m/s" 

mp 

2.1 7645  X 10  ® 

kg 

Ip 

1.61624  X 10"® 

m 

tp 

5.39121  X 10"^ 

s 

Gp/(^c)^ 

1.16639  X 10  ® 

GeV" 

sin^  0y^ 

0.22215 

— 
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Table  A-6.  (cont.)  Useful  physical  constants. 


PHYSICAL  CONSTANT 

SYMBOL 

VALUE 

UNIT 

Josephson  constant 

Kj-90 

483597.879 

GHz-V' 

Von  Klitzing  constant 

^K-90 

25812.807449 

Q 

astronomical  unit 

AU 

1.4959787061  x 10® 

km 

solar  mass 

Ms 

1.9891  X 10®° 

kg 

earth  mass 

Me 

5.9742  X 10®^ 

kg 

lunar  mass 

M. 

7.3483  X 10®® 

kg 

solar  radius 

Rs 

6.9599  X 10® 

m 

earth  radius,  average 

Re 

6.3710  X 10® 

m 

" " , polar 

6.356755  X 10® 

m 

" " , equatorial 

6.378140  X 10® 

m 

lunar  radius 

R. 

1.7380  X 10® 

m 

earth-sun  distance,  average 

°e 

1.4959787061  x 10® 

km 

" " , aphelion 

Q 

1.5207  X 10® 

km 

" " , perihelion 

q 

1.4707  X 10® 

km 

earth-moon  distance,  average 

Om 

3.8440  X 10® 

km 

" " , apogee 

Q 

4.0674  X 10® 

km 

" " , perigee 

9m 

3.5641  X 10® 

km 

solar  luminosity 

Ls 

3.826  X 10®® 

J/s 

solar  radiation  at  top  of  atmosphere 

lo 

1360.45 

W/m® 

velocity  of  sound  in  dry  air  @ 0°C 

Cs 

331.36 

m/s 

viscosity  of  air  at  288.1 5 K 

h 

1.7894  X 10  ® 

kg/m-s 

density  of  water  @ 3.98°C 

Pw 

1000.000000 

kg/m® 

heat  of  fusion  of  water  @ 0°C 

333.51 

J/kg 

heat  of  vaporization  of  water  @ 1 00°C 

2257.48 

J/kg 

heat  capacity  of  solid  water  @ 298  K 

37.11 

J/K-mol 

heat  capacity  of  liquid  water  @ 298  K 

75.35 

J/K-mol 

heat  capacity  of  water  vapor  @ 298  K 

Cp° 

33.60 

J/K-mol 

Standard  Temperature  and  Pressure  (STP)  is  T = 273.1 5K  and  p = latm. 
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Selected  Conversion  Factors 


Many  units  which  are  not  part  of  the  SI  unit  system  may  be  encountered  in  discussions  of 
sensors  and  weapons  systems.  The  following  list  of  selected  conversion  factors  is  presented  to 
facilitate  comparison  and  assimilation  of  results  from  different  sources.  The  list  is  not  organized 
according  to  any  significant  degree. 

1 atm  = 760Torr  = 101325.0  Pa  = 29.921 2598  in  Hg  = 14.695941  psi 
= 1.01325  X 10*  dyne/cm"  = 1013.25  mbar  = 33.90  ft  HjO  @ 4°C 

1 C = 0.1  nm  = 10  ’°  m 1 Fermi  = 1 0 ’*  m 

1 fathom  = 6 ft  = 1 .82880000  m 

1 mil  = 25.4  pm  = 0.001  in  = 0.0254  mm 

1 m = 39.37007874  in  = 1.093613298  yd 

1 mi  = 1609.3440000  m = 5280  ft  = 1760.0000  yd 

1 nmi  = 1852.0000  m = 6076.1155  ft  = 2025.3718  yd 

1 furlong  = 40  rods  = 220  yards  = 660  feet  = 0.125  mi  (statute)  = 201.168  m 

1 league  (statute)  = 0.86897625  leagues  (naut.  Int.)  = 4.828032  km  = 3 mi  (statute) 

1 light  year  (ly)  = 9.460530  x 1 0’*  m = 63239.74  AU 

1 parsec  (pc)  = 3.085678  x 1 0’*  m = 3.261633  ly 

1 year  ~ 365.25  days  = 8766  hr  = 525960  min  = 31 ,557,600  sec  ~ ti  x 1 0^  sec 
1 kt  = 1 nmi/hr  = 0.51444444  m/s  = 1.150779451  mph 
1 Liter  = 1.056688  qt  = 0.001  m* 

1 gal  (US)  = 4 qt  = 128  fl.oz.  = 3.7854  liters  = 3785.4  cm* 

1 ounce  (US  fluid)  = 29.  5727  ml 

1 ounce  (US  fluid)  = 8 drams  = 2 tablespoons  (tbsp)  = 6 teaspoons  (tsp) 

1 US  barrel  of  petroleum  (bbl)  = 42  gal  = 5.6146  ft*  = 158.983  liters 
1 hogshead  = 63  gal  = 238.47427  I 
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1 bushel  (US)  = 32  dry  qt  = 37.2367  liq  qt  = 35.2381  liters 

1 long  ton  (LT)=  2240  lbs  = 1.12  ton  = 101 6.05  kg  = 1 .01 605  metric  ton  (mT  =tonne) 
1 pound  (lb)  = 16  oz  (Avoirdupois)  = 12  oz  (Troy)  = 256  drams  = 7000  grains 
1 pound  (lb)  = 453.  59237  g 
1 ounce  (Avoirdupois)  = 28.349523  g 

1 carat  = 0.2  g = 3.08647  grains  = 0.00705479  oz  (Avoirdupois) 

1 Newton  = 1 00,000  dynes  1 dyne  = 1 0'^  N 

1 foot-pound  = 1 .35582  J lerg  = 10'^  J 

1 rad  = 57.295780°  1°  = 1 7.45329237  mrad 

1 prad  = 0.20626480"  1"  = 4.848137  prad 

1 cal  (thermochemical)  = 4.1 840000  J = 4.1 840000  x 1 0^  ergs 

= 2.61 1447677  X 10’’ eV 

1 cal  (Int'l.  Steam  Table)  = 4.1868000  J = 4.1868000  x 10^  ergs 

= 2.61 31 95300  X 10”  eV 

1 Btu  (thermochemical)  = 1 054.350  J = 251 .9957  cal  (thermochemical) 

1 Btu  (Int'l  Steam  Table)  = 1 055.05585  J = 251 .9957  cal  (Int'l  Steam  Table) 

1 eV  = 1.60217653  X 10”  J = 2.41  798940  x 10’^  Hz  = 8065.54446  cm  ’ 

1cm  ’ = 2.99792458  X 10’°  Hz  = 1 .239841 90  x 1 0'^  eV 
= 1.98644560  X 10”°  J = 4.74771 893  x 1 0”  kcal 

1 eV/molecule  = 23.0605492  kcal/mol  = 9.64853377  x 10^  J/mol 

1 cm  ’/molecule  = 2.8591 4352  x 1 0 ° kcal/mol  = 1 1 .9626565  J/mol 

1 kcal/mol  = 6.94769460  x 1 0”’ J/molecule 

= 349.755090  cm  ’/molecule  = 0.0433641 01 6 eV/molecule 

kT  (Boltzmann  const,  x Temperature)  = 1 eV,  when  T = 1 .160450476  x lO'’  K 

kT  = 1 cm  ’,  when  T = 1.43877512  K 

1 MeV  = 1.78266180  X 10  °°  kg 

1 megaton  of  TNT  (MT)  = 1 x 1 0’°  cal  = 4.1 84  x 1 0’°  J = 2.61 144757  x 10°°  MeV 
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1 Uranium  or  Plutonium  fission  = 200  MeV  total  energy  released 

= 1 80  MeV  prompt  energy  released 

1 Deuterium-Tritium  fusion  = 1 7.59  MeV  energy  released 

1 MT  = 1 .4508  X 1 0^^  fissions  = 240.9  moles  = 56.61  kg  U-235  fissioned 
1 kg  U-235  fissioned  = 1 7.665  kT 

1 MT  = 1 .48462  X 1 0^^  fusions  = 2465  moles  = 12.33  kg  d-t  fused 
1 kg  d-t  fused  = 81.10  kT 

1 MT  = 0.046553  kg  matter-antimatter  completely  annihilated 
1 kg  Matter-Antimatter  annihilation  = 8.987551 79  x 1 O’*  J = 21.4808  MT 
For  photons:  ^ X E = 1 .239841 90  pm-eV 
1 gauss  = 0.0001  T = 10*  gamma 

1 hp  = 0.7457  kW  = 550  foot-pounds/sec  = 0.706243  Btu/sec 
1 ft-candle  = 10.76391  lux  1 Lambert  = 31 83.099  cd/m^ 

1 ft-L  (foot- Lambert)  = 3.426259  cd/m^ 

1 acre  = 43560  ft"  = 0.001 5625  sq. mi.  = 4046.8564  m"  = 0.40468564  hectare  (ha) 
1 township  = 36  sections  = 36  sq.  mi.  = 23,040  acres  = 93.23957  km" 

1 acre-foot  = 43560  ft"  = 1233.482  m"  = 325,851.43  gal 

°F  = (9/5)°C  -1-32  °C  = (5/9)(°F  - 32) 

1 flick  = 1 W/cm"-sr-pm 

1 ppmv=  10  * P/RT  = 4.4615  X 10  * (273. 15/T)(P/101 325) 

= 1 .20272  X 10  " (P/T)  mole/m" 

1 ppmv  = 1 0 * PM/RT  = 1 .20272  x 1 0'^  MP/T  mg/m" 

1 pgal  = 10  " m-s  " ~ 10  ’"g" 

1 Eotvos  = 1 E = 10’ s"  = 0.1  |igal/m 
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Useful  Mathematical  Expressions  [14],  [15] 


Differentials 

d{uv)  - udv  + vdu 


nu"  ^du 


/ u]  vdu-udv 
d - = — ^ — 

\v  V 


dy  _ dy  du 
dx  du  dx 


d[e“^-  e“du 


d(a“^  - a“  \na  du 


d{\nu)  = 


J(log^  u) 


u\na 


i/(sinM)  - cosM  du 


J(cos  m)  = - sin  u du 


J(tan«)  = sec^  u du 


d (cot  m)  = - csc^  u du 


d (sec  m)  = sec  u tan  u du 


d (esc  m)  = - CSC  u cot  u du 


sin  u\ 


cos  u\  = - 


j(tan  ^ u^  = — 


i/fj  f{t)dt\-f{u)du 


J(sinhM)  = coshw  du 


J(coshM)  = sinhw  du 


j(sinh  ' 


j(cosh  ' m) 


df[t,u,v,---,z)  _ df  dt  df  du  df  dv  df  dz 

ds  dt  ds  du  ds  dv  ds  dz  ds 
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Definite  integrals 


poo 

du 

Jo 


(•00  \ 
p-au  ^ 

a 


^00 

£ =r(«)  = («-!)! 


Jo  a”"'  fl”"' 


•'o  la 


roo  2 2 1 

f duue-“"  = — ^ 
Jo  2a' 


I 


00  ^ 2 2 '\J  71 

duu^  e “ = — ; 

4a' 


poo 

du 

Jo 


2n  -au^ 

u e 


1-3-5  •••  (2a-l)V^ 


n+\  _2/2  + 1 


2”^‘a 


I 


u' 


71 


du 

0 e“-l  15 


poo 

du 

J— 00 


71 


(l  + a'a')(l  + ^'a')  a+b 


1 


nil 

du  sin”  u = 


1- 3-5-7---(a-l)  71 

2-4-6-8---W  ~2 

2- 4-6-8---(a-l) 
l-3-5-7---a 


n = even  integer  {n  ^ 0) 
n = odd  integer  {n  ^ l) 
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Quadratic  equation 

If  ax^  +bx  + c = Q,  then  x = 

Cubic  equation 

If  + py^  +qy  + r ^0 

then  let  a — {3q  — 3 


-b±4h^^^-4^ 


Also  let 


Then  the  solutions  are 


Functions  of  right  triangles 

Pythagorean  theorem 

2 2 2 
+y^  ^ r 

sm.6  - y I r 
COS0-  x!  r 


X = 


2a 

and 

b=( 

and 

27 

B = 

A + B, 

A + B 

1 

1 

2 

2 ^ 

A + B 

2 

2 ^ 

\-b 


1^  ^ 
4 ^27 


sec^  - rlx  - \ I cos  6 


CSC 6 - r/y  - 1/sin^ 
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tan  6 = X ! y 


sin6' 

cos^ 


Laws  of  sines  and  cosines 


a _ b _ c 
sin  A sin  B sin  C 


= diameter  of  eireumscribed  eirele 
(See  below) 

-b^  +c^  - IbccosA 
b^  - +c^  - laccosB 

+b^  - lab  cos  C 
a -bcosC  + ccosB 
c-  acosB  + bcosA 


cot  6 = y ! X = \ ! tan  6 


b - a cosC  + ccosvd 


sin  A = —^s[s  - a)[s  - b)[s  - c)  where  s = [a  + b + c)/2 


Trigonometric  identities 

sin^  (9  + cos^  = 1 sec^  - tan^  6>  = 1 csc^  - cot^  = 1 

sin(a  ± 0)-  sin  a cos P ± cos  a sin  p 

cos(a  ± P)-  cos  a cos  P + sina  sin  p 

tan  a ± tan  P 
1 + tan  a tan  p 

sin  2^  = 2 sin  ^ cos  ^ 


tan(a  + P) 
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cos2^  - cos^  6 - sin^  6-1  cos^  ^-1  = 1-2  sin^  6 


tan  2^  = 


2tan^ 

1 - tan^  0 


sin  36>  = 3 sin  - 4 sin^  6 


cos36>  = 4cos^  6 - 3cos6> 


tan  W - 


3 tan  6 - tan^  6 
1-3  tan^  6 


sin4^  = 8sin^  cos^  ^ - 4 sin  ^ cos  ^ 


cos4^  = Scos"^  ^ - 8cos^^  + 1 


sin  <2  + sin  = 2 sin  0.5(<2  + 0)  ■ cos  0.5(a  - 0) 
sin  <2  - sin  = 2 cos  0.5(a  + y^)  • sin  0.5(<2  - 0) 
cos  <2  + cos  P = 1 cos  0.5(a  + 0)  • cos  0.5(<2  - 0) 
cos  a - cos  P = -l  sin  0.5(a  + 0)  • sin  0.5(<2  - 0) 


tan  a + tan  0 = 


tan  a - tan  0 = 


sin(a  + y^) 
cos  a cos  y^ 

sin(a-y^) 

cosacosy^ 


sin  a^m0-  0.5[cos(a  + y^)  - cos(ci'  - y^)] 
cos  a cos  0 = 0.5[cos(<2  + 0)  + cos(<2  - y0)] 
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sin  a cos  j3  = 0.5[sin(a  + 0)  + sin(a  - 0)^ 
cos  <2  sin  P - 0.5[sin(a  + 0)-  sin(a  - 0)^ 

e""  - 

sinh  X cosh  x 

2 

tanh  X = com  x = 

e +e 

sech  X = — — csch  x ^ 

coshx 

e~'^  - cos6±i sin^ 


Taylor  series  expansion 

/(x  + Xo)  = /(xo)  + /'(xo)x  + /"(xo)^  + /"'(xo) 
Expansions 

(l±x)  ^ = 1 + X + X^  +x^+-" 


(i+xy^^=i+ 


X 

2 


8 16 


(1  + x) 


, X 3x^ 

= 1 + 

2 8 


5x^ 

1_  . . . 

16 


(l  + x)  * = 1 + X + X^  + X^  + x"*  + X^  H 

(l  ± x)  ^ = 1 + 2x  + 3x^  + 4x^  + 5x"^  + 6x^  H — 


e""  +e  "" 

’ 2 

1 

tanhx 

1 

sinhx 


1_  . . . 

3! 
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(l  + x)  =l±nx  + -^^ ’-x^ 

^ ’ 2!  3! 


+ 


/ w(w  + 1)  ■j  w(w  + l)(w  + 2)  T 

(1  + x)  "=\  + nx+  ’-x^  + 


2! 


3! 


/ . \n  « W 1^  ^_2  2 

(•^  — J’)  “ + Yj-^  y + + 


2 3 

X X 


C — 1 + X H 1 h 

2!  3! 


2! 


3! 


^ X- 1 1 

Inx  = 1 — 

X 2 


^x-O^ 


y X j 


+■ 


^x-O^ 


V X ; 


+ 


Inx  = (x-  l)--^(x- 1)^  +^(x- 1)^ 


1 

X > — 

2 

0 < X < 2 


Inx  = 2 


( x~n 

VX  + ly 


1 

+ - 


rx-n'  1 


3 Vx  + ly 


ln(i?  + x)  = Ina  + 2 


X 


A 


+ - 
5 

1 


VX  + ly 


+ 


2fl  + x 


+ - 


X 


A' 


1 

+ - 


3v2fl  + xy  5\2a  + x) 


X 


X > 0 
A' 

+ 


where  a>0,  -a  <x<  +go 


«!  = «x(«-l)x-"x2xl  = e for  large  n 


x^  x^ 

sinx  = X + 

3!  5! 
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+ ... 


X + • • • 

6-7 

1-3-5  V 

■ X + 

2-4-6-7 

x'<l 

X > 1 

X < -1 


(x^  < ;rV4) 


n 


k=\ 


k = l 


n{n  + 1)(2«  + 1) 

6 

n\n  + \f 
4 

n{n  + 1)(2«  + l)(3n^  + 3n  - 1) 
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Vector  operators 


50  . 50  . 50  . 

VO  = x + y + z 

dx  dy  dz 


Gradient  operator 


-j  dA^  dA  dA. 
V-A  = — ^ + — - + ■ 


dx  dy  dz 


Divergence  operator 


VxA  = 


dy  dz 


x + 


^ dA^  dA^  ^ 
V 5z  dx  ^ 


y+ 


dx  dy 


( dZ 


V"  = v-v  = 


5"  5"  5 


2 \ 


+ — ^ + 


dx  dy  dz 


Laplacian  operator 


V"(^0)  = "FV^O  + 2VFVO  + OV^'F 


v(v-i)  = + Vx  Vx  i 


V xV  X A = 


V X VO  = 0 


V-(Vxi) 


V(m-v)  - (v-V)m  + (m-V)v  + vxVxu  + uxV 


X V 


Curl  operator 
VVO  = V^O 

v(v-i)- V"Ji 

0 
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Cylindrical  coordinates 
x-r 

y = r sin  (j) 
z = z 


VO  = 


r + 

dr 


1 0O 
r d(j) 


V^(D  = 


iA 

r dr 


ao 

ar 


+ 


1 a'o  a"o 

+ 


r"  a^zi"  az^ 


r dr  r d(j)  dz 


r 

r 


z 

r 


VxA  = 


a 

dr 


a 

a^^ 


a 

az 
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Spherical  coordinates 

X = r co^(!>  sin6> 
y - r sin  (j)  sin  9 
z-r  QosO 


50  ^ 1 50  - 1 50  - 

VO  = r + 6 + — ^ (b 

dr  r d6  rsinO  d(j) 


V^O  = 


dr 


50 

dr 


+ 


1 5 


r^  sin^  d6 


■ 

sin  6' 

d9 


+ 


1 5^0 

r^  sin^  6 d(j)^ 


V 1 , 1 5(sin^^,)  ^ 1 dA^ 

r^  dr  rsinO  dO  rsinO  d<j) 


VxA  = 


d 

dr 


9 


r^  sin  9 r sin  9 


d 

~d9 


t 

r 

d 


rAg  r sin 9 A. 
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Miscellaneous  Formulae 


Circumference  of  circle  (radius  a) 

s = iTia 

Area  of  circle  (radius  a) 

A - Tva^ 

Area  of  an  ellipse  (axes  a,  b) 

A - Trab 

Surface  area  of  sphere  (radius  a) 

A - 47ra^ 

Volume  of  sphere  (radius  a) 

V^47ra^  / 3 

Length  of  chord  of  circle 

s -2a  sin(^/  2) 

(radius  a,  angular  subtense  0,  chord  height  h) 

II 

1 

Area  of  curved  surface  of  a right  cone 
(height  h,  base  radius  a) 

A = na4a^ 

Volume  of  a right  cone 
(height  h,  base  radius  a) 

V 13 

Radius  of  circle  inscribed  in  triangle 

Js[s  -a)[s-  b)[s  - c) 
r = — 

Radius  of  circumscribed  circle 

abc 

-tV  1 

4Js[s  - - b)[s  - c) 

(Triangle  side  lengths  a,  b,  and  c) 

s = [a  + b + c)  / 2 (In  2 equations 

above) 
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APPENDIX  B 


FINITE  DIFFERENCE  AND  FINITE  ELEMENT  TECHNIQUES 


Finite  Difference  Approximation  of  Derivatives 

Many  systems  of  differential  equations  cannot  be  analyzed  to  yield  closed-form  solutions. 
Many  numerical  analysis  techniques  exist  which  allow  the  behavior  of  such  systems  to  be  modeled 
on  a computer  [l]-[5].  Approximation  of  derivatives  by  ratios  of  finite  differences  of  the  variables 
is  one  of  the  more  easily  applied. 

The  basis  of  the  finite  difference  approach  is  found  in  the  definition  of  a derivative.  Given 
a function  X of  variable  t,  the  derivative  ofx  with  respect  to  t is  defined  as  [6] 

dx  ,.  -I- A?)  - x(^) 

— — (B.l) 

dt  Af->o  A? 


Consider  a sequence  of  values  = nAt,  then  for  any  value  i,  there  is  a value  x,  = x(t,).  Each  x,  is  a 
finite  element  of  extent  At  of  the  function  x(t).  The  finite  difference  ratio 


h.i  - h - h At 


(B.2) 


becomes  the  derivative  when  the  increment  At  becomes  infinitesimally  small.  For  nonzero  At,  Eq. 
(B.2)  is  an  approximation  to  the  derivative.  The  smaller  At  is,  the  better  the  approximation. 
Furthermore,  the  mean  value  theorem  states  that  at  least  one  point  tc,  such  that  t,  < t<-  < t,+;,  has  a 
derivative  equal  to  the  value  of  Eq.(B.2),  for  any  At. 

A derivative  approximation 


dx 

dt 


U+\  ~ U 


(B.3) 


is  the  best  approximation  to  the  true  dx/dt  at  t = f + O.SAt,  not  at  t = t;  or  t = f+i.  In  using  the  finite 
difference  approach  to  solving  differential  equations,  all  functions  and  derivatives  must  be  evaluated 
at  the  same  point.  Thus  it  is  more  useful  to  define  the  first  derivative  as 
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dx 

dt 


(B.4) 


which  is  evaluated  at  t = h rather  than  halfway  between  t;  and  t;+i. 

The  second  derivative  may  be  defined  in  terms  of  a difference  of  first  derivatives 


dx 

dx 

d^x  dt 

t.-  +0.5  Af 

dt 

dt^  txt 


M A? 

At 


x,^,-2x,+x,_, 

{Atf 


(B.5) 


Eq.(B.5)  is  properly  evaluated  at  t = f. 
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Finite  Difference  Solution  of  Differential  Equations 

Now  consider  the  following  differential  equation 

A{x,t)^^^  + B(^x,t)  — +C[x,t)  = Q (B.6) 

dt  d t 

The  finite  difference  solution  of  Eq.(B.6)  proceeds  by  substituting  for  the  derivatives  to  obtain 

(A?)  Ar 

If  Xj  and  Xj.i  are  known,  then  it  becomes  a simple  matter  to  solve  for  the  unknown  Xj+j.  When  Xj+j 
is  known,  it  becomes  the  new  x{,  the  old  X;  becomes  the  new  X;.j  and  the  process  of  solving  for  X;+i 
repeats.  This  recursive  process  can  be  repeated  indefinitely.  Once  the  complete  set  of  X;  is  known 
for  the  set  of  k,  the  solution  is  in  hand. 


The  problem  with  the  recursive  solution  Eq.(B.7)  is  that  X;.j  and  X;  must  be  known  to 
determine  Xj+j.  Fortunately,  many  interesting  problems  begin  with  known  initial  values 


and 


x(t^)^D 


(B.8) 


(B.9) 


If  we  shift  the  t-axis  such  that  i=0  coincides  with  t = to,  then 
^(^o)  = ^ = ^o 


(B.IO) 


If  At  is  small  enough,  the  function  x(t)  can  be  approximated  by  straight  line  segment  between  any 
two  neighboring  points.  In  this  limit  the  derivative  can  be  assumed  to  be  symmetric  about  its  center 
point 


dx 

dt 


Ibd  A? 


For  i=0,  we  can  solve  for  x.j 


(B.ll) 
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X , = X.  - A?  — = D-  E At 
dt  , 


With  the  results  (B.IO)  and  (B.12)  the  recursive  solution  process  may  begin. 
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Finite  Element  Computation  of  Definite  Integrals 

Problems  with  continuous  variation  of  some  parameter  can  often  be  solved  by  breaking 
problem  into  finite  chunks  of  space  and/or  time,  treating  each  finite  element  as  a homogeneous 
entity,  and  evaluating  the  interactions  between  discrete  elements  to  determine  the  behavior  of  the 
whole.  As  the  number  of  elements  gets  larger  (and  their  finite  extent  gets  smaller),  the  discrete 
result  approaches  the  continuous  result.  In  its  simplest  form  one  may  use  this  concept  to  compute 
definite  integrals. 

A definite  integral  (in  one  variable)  can  be  considered  to  represent  the  area  under  a segment 
of  a continuous  curve.  If  the  curve  is  broken  into  finite  elements  and  the  area  under  each  element 
is  estimated,  then  the  total  area  is  the  sum  of  the  areas  under  the  finite  elements.  The  simplest 
estimate  of  area  under  a segment  of  curve  is  value  of  the  curve  at  the  midpoint  times  the  width  of 
the  segment.  This  is  a rectangular  approximation.  Other  more  sophisticated  estimates  can  be  used. 
These  are  more  computationally  efficient,  but  in  the  limit  (and  assuming  perfect  computers)  will  not 
give  better  results.  Since  we  are  merely  establishing  a general  technique,  it  is  not  only  satisfactory 
but  preferable  to  stick  with  the  simplest  approximations.  In  this  case,  it  can  be  shown  that  the 
definite  integral  of  a function  can  be  computed  using  the  expression 


b-a 


i=\ 

N 


f 


N y 


a + 


('->2) 


Y 


1=1 


(B.I3) 


As  N gets  larger,  Ar  gets  smaller,  and  the  approximation  to  the  true  value  of  the  definite  integral  gets 
more  accurate.  In  the  limit  as  N the  identity  will  be  valid.  Eq.  (B.13)  is  the  finite  element 
statement  of  the  rectangular  rule  for  numerical  integration.  It  is  readily  generalized  to  multiple 
functions  and  multiple  dimensions  if  necessary. 
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APPENDIX  C 


PROBABILITY  AND  STATISTICS 


Probability  and  statistics  are  tools  that  are  essential  in  the  analysis  of  combat  systems.  In  this 
appendix  we  present  a brief  review  of  the  critical  aspects  of  these  tools.  We  begin  with  combina- 
torial analysis  - the  ways  in  which  groups  of  objects  can  be  ordered  and  arranged. 


Combinatorial  analysis 

Combinatorial  analysis  involves  the  different  possibilities  of  arranging  objects.  The  objects 
to  be  arranged  are  called  elements  and  the  combination  of  these  elements  including  their  order  is 
called  an  arrangement.  Every  possible  arrangement  of  a given  set  of  elements  in  any  order  is  called 
a permutation  of  those  elements.  For  example,  given  elements  a,  b,  and  c then  the  possible 
permutations  are: 

abc,  cab,  bca,  acb,  bac,  cba 

There  are  n!  permutations  of  distinct  elements.  If  some  elements  (/,  k, ...,  and  m)  are  identical,  and 
the  numbers  of  the  identical  elements  are  PpPk,  ■■■,  and  then  the  number  of  permutations  is  given 

by 

N = . (C.l) 


Two  elements  in  an  arrangement  are  said  to  form  an  inversion  if  their  ordering  is  opposite  to  their 
natural  (or  original?)  order.  For  example,  we  think  of  the  order  ab  cd  eas  natural.  If  we  have  an 
arrangement  b ac  ed,  then  the  pairs  of  elements  b and  a and  e and  d form  inversions.  A permuta- 
tion is  said  to  be  an  even  permutation  if  the  number  of  inversions  is  even,  and  an  odd  permutation 
if  the  number  of  inversions  is  odd. 


The  number  of  permutations  of  k different  elements  selected  from  n possible  elements 
without  repetition  of  any  selected  element  is  given  by 


(C.2) 


The  number  of  permutations  of  k elements  selected  from  n possible  elements  with  any  degree  of 
repetition  allowed  is  given  by 
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npir)  ^ 


(C.3) 


In  a combination  the  order  in  which  the  elements  occur  is  not  taken  into  account.  Thus,  ab 
and  ba  represent  permutations  of  the  combination  of  a and  b.  The  number  of  combinations  of  k 
elements  taken  from  n elements  without  repetition  is 


kj  (n-ky.k\ 


(C.4) 


The  numbers  represented  by  the  are  sometimes  called  the  binomial  coefficients  because  the 
binomial  expansion  can  be  written  as 


(X  + = x”  + + • • • + 7 + - 

fi  ^ . 

l!(n-l)! 


n]  n],  n ] n] 

x”+  x”-V  + ---+  x"~'"y'"  + ---+  / 

0 1 \m  \n 


(C.5) 


^ ( ri\ 

k=o\^  J 


The  number  of  combinations  of  k elements  taken  from  n possible  elements  with  any  degree  of 
repetition  is  given  by 


n + k-\ 


(C.6) 
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Probability 

Classical  probability  theory  concerns  itself  with  trials  and  events.  A trial  is  an  observation 
or  experiment  that  is  characterized  by  a set  of  conditions  to  be  satisfied  and  by  being  repeatable. 
An  event  is  the  result  of  a trial  that  can  occur  but  does  not  necessarily  occur.  A certain  event  S is 
one  which  always  occurs.  An  impossible  event  0 is  one  which  never  occurs.  Any  trial  may  yield 
a number  of  events  simultaneously.  For  example,  if  the  outcomes  of  a trial  are  integers,  then  each 
integer  is  an  event,  odd  integers  and  even  integers  are  events,  prime  integers  are  events,  etc. 
Obviously,  if  the  outcome  of  the  trial  is  7,  then  odd  integer,  and  prime  integer  are  also  outcomes, 
and  therefore  events.  Events  are  said  to  be  mutually  exclusive,  if  as  the  result  of  a trial  only  one 
of  them  can  occur.  In  the  integer  example,  odd  integer  and  even  integer  are  mutually  exclusive 
events.  An  event  is  the  sum  C of  multiple  events  A,  B,  ...  if  in  any  trial  any  of  the  events  A,  B,  or 
...  occur.  The  sum  is  denoted  by 

C = Au  Bu---  (C.7) 


An  event  is  product  C of  multiple  events  A,  B,  ...if  in  any  trial  all  of  the  events  A,  B,  and ...  occur. 
The  product  is  denoted  by 

C = AnBn---  (C.8) 


The  classical  definition  of  probability  is:  '‘The  probability  of  an  event  A equals  the  ratio  of 
the  favorable  outcomes  to  the  total  number  of  outcomes,  provided  they  are  equally  likely.''  Thus, 
if  a trial  can  result  in  n equally  likely  events  and  if  m of  these  are  favorable  to  the  occurrence  of  an 
event  E,  then  the  probability  P(E)  of  the  event  E occurring  is 


P(E) 


m 

n 


number  of  favorable  events 
number  of  possible  events 


(C.9) 


The  probability  of  the  sum  of  a number  of  mutually  exclusive  events  (i.e.,  the  probability  that  any 
of  the  events  occurs)  is  equal  to  the  sum  of  the  probabilities  of  these  events. 

P{Ey  + E^  + + Ej^)-  P[Ey)+  P[E 2)+  •••  + P{Ej^)  (C.IO) 


An  unconditional  probability  depends  only  on  the  set  of  conditions  to  be  satisfied  by  the 
trial.  A conditional  probability  depends  on  at  least  one  further  condition.  This  additional 
condition  is  often  the  prior  occurrence  of  some  other  event.  The  conditional  probability  that  an 
event  E occurs  given  that  the  event  F has  already  occurred  is  denoted  by 

= Probability(£'  occurs  given  F has  occurred)  (C.l  1) 
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The  probability  P(EF)  for  the  simultaneous  occurrence  of  two  events  E and  F is  the  product  of  the 
probability  P(E)  of  the  first  event  E and  the  conditional  probability  P(F\E)  that  the  second  event  F 
will  occur  given  that  E has  already  occurred. 

P(EF)^  P(E)-P(F\E)  (C.12) 

If  a set  of  events  F„  where  z = 1,  2, n,  forms  a complete  system,  then  one  and  only  one 
of  these  events  must  always  result  in  any  trial.  If  a set  of  events  F,  forms  a complete  system  and  E 
is  a further  event,  then  the  event  F»F,  for  any  i is  mutually  exclusive  with  any  other  event  E»Fj 
where  j differs  from  i.  Thus,  the  total  probability  (unconditional)  of  the  event  E is  given  by 


p(e)  = Zp{^f,)='^p(f,)p{e\f,)  (C.13) 

i-l  i-\ 

Two  events  are  independent  of  each  other,  if  the  occurrence  (or  non-occurrence)  of  one 
event  does  not  affect  the  occurrence  or  non-occurrence  of  the  second  event.  That  is, 

P(e\F)  = P(E)  and  P(f|F)  = F(F).  (C.14) 

If  a set  of  events  F,  where  z = 1, ...,  zz  are  all  independent,  then  the  probability  that  all  events  of  the 
set  will  occur  in  a trial  is  given  by 

P{E,nE,n-nE,)  = P(E,)-  P{E,)- ■ P{E, ) = fl  P(£, ) (C.15) 


A useful  result  in  probability  theory  is  Bayes’  theorem.  This  theorem  relates  the  conditional 
probability  of  one  event  occurring  if  a second  event  occurs  to  the  inverse  conditional  probability  (the 
probability  of  the  “second”  event  occurring  given  that  the  “first”  event  occurs).  That  is. 


/-(/-|zr)  P[E\F,)P(F.) 

t,P{E\F,)P{F,) 


(C.16) 


Another  useful  result  is  the  probability  that  an  event  occurs  at  least  once  in  N independent  trials. 
If  P(l)  is  the  probability  that  the  event  occurs  in  one  trial,  then 

P„(l)  = l-P(l)  (C.17) 


is  the  probability  that  the  event  does  not  occur,  i.e.,  the  probability  of  a “miss”.  The  probability  of 
N consecutive  “misses”  is 
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(C.18) 


A,(JV)  = [P„(l)]"=[l-P(l)f 

The  probability  that  at  least  one  “hit”  occurs  is  one  minus  the  probability  of  getting  all  “misses”. 
Thus,  the  probability  of  at  least  one  “hit”  in  N trials  is  given  by 


^(^)  = l-^M(V)=l-[l-P(l)f  . 


(C.19) 
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Random  Variables  and  Functions  of  Random  Variables 


A random  variableXis  a variable  which,  in  repeated  trials,  assumes  different  values  x,  each 
of  which  represents  a random  event.  A event  can  be  considered  random  if  its  probability  of 
occurrence  is  not  conditional  on  the  outcome  of  any  earlier  trial.  Random  variables  are  discrete  if 
X can  assume  only  a finite  number  (or  at  most  a countably  infinite  number)  of  values.  A countably 
infinite  quantity  is  one  in  which  given  any  finite  length  interval  of  the  variable’s  range  the  number 
of  possible  values  is  finite.  For  example  the  set  of  integers  is  countably  infinite  (in  any  finite  range 
of  numbers  fromy^  to  there  is  a finite  and  countable  number  of  integers).  Random  variable  are 
continuous  if  X can  assume  all  possible  values  in  a finite  or  infinite  interval. 

For  discrete  random  variables,  the  random  events  x,  are  treated  as  discontinuities.  Only  the 
finite  values  x,  have  meaning.  The  probability  of  each  random  event  P(X  = x)  is  treated  as  the 
magnitude  of  the  discontinuity.  From  a functional  perspective  we  can  assign  a delta  function  to  each 
value  X,  and  a finite  probability  P(x)  to  each  delta  function.  That  is, 

P(x)  = (C.20) 

The  sum  over  all  possible  values  of  x must  of  course  equal  unity. 

^00  ” 

J JxP(x)  = ^P(x.)<^(xJ  = ^P(X  = xJ  = l (C.21) 

i-\  all/ 

For  continuous  random  variables,  the  probability  of  getting  precisely  a value  x is  zero, 
because  there  are  an  infinite  number  of  precise  values  that  could  be  selected.  From  our  earlier  result, 
if  there  is  one  event  which  gives  the  precise  value  x and  an  infinite  number  of  events  that  give  values 
other  thanx  then  the  probability  will  be  l/oo  or  zero.  We  must  therefore  speak  of  the  probability  that 
the  value  of  X lies  within  a small  range  about  x.  We  call  the  function  that  describes  this  probability 
a probability  density  function  f(x).  More  specifically  we  can  ascribe  the  probability  that  X takes 
on  a value  between  x - 0.5dx  and  x + 0.5dx,  where  dx  is  a very  small  interval,  to  the  function  f(x) 

P{x  - 0.5t/x  < X < X + 0.5t/x)  = f{x)dx  (C.22) 

Note  that  a more  rigorous  mathematical  treatment  of  probability  might  replace  the  second  < sign 
with  a < sign.  However,  since  P(x)  = 0 for  any  exact  value,  there  is  no  error  in  extending  the 
expression  to  include  both  ends  of  the  interval. 

It  is  common  to  define  f(x)  dx  using  a unit  interval.  In  this  case  the  dx  is  usually  dropped. 
The  probability  density  function  f(x)  then  has  a value  of  probability  per  unit  interval  (of  X).  When 
multiplied  by  an  interval,  the  probability  density  function  yields  the  probability  that  the  function 
takes  a value  within  that  interval.  The  definite  integral  of  the  probability  density  function  over  a 
larger  interval  yields  the  probability  that  the  variable  lies  within  that  larger  interval,  i.e., 
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\ dx  f (x)  = Probability  that  a > x > b 

Ja 


(C.23) 


Since  f(x)  represents  a probability  per  unit  interval,  if  f(x)  is  integrated  over  the  entire  range  of 
possible  values,  the  total  probability  must  be  unity. 

dx  /(x)  = 1 (C.24) 

A second  useful  function  is  the  definite  integral  of f(x)  from  -oo  to  some  finite  value  x.  This 
integral  denoted  by  F(x)  in  many  texts  is  known  as  the  distribution  function  or  the  cumulative 
probability  function. 


F(x)  = J dt  f[t)~  Probability  that  X < x 


(C.25) 


That  is,  F(x)  is  the  probability  that  the  random  variable  X takes  on  some  value  that  is  less  than  or 
equal  to  x.  The  relationship  between  f(x)  and  F(x)  is  shown  in  Figure  C-1. 


Figure  C-1.  Relationship  between  a probability  density  function  and 
the  corresponding  distribution  function. 


OR 

CUMULATIVE  PROBABILITY 


547 


Combining  Eqs.  (C.24)  and  (C.25),  we  find 


F(oo)  = J dt  f{t)~  Probability  thatX  < 00  = 1 


(C.26) 


That  is,  the  probability  of  getting  any  possible  value  must  be  unity.  As  a consequence,  we  can 
quickly  determine  that 


^00 

1 - F{x)  = J dt  f{t)  = Probability  that  X>x. 


(C.27) 


Since  F{a)  is  the  probability  that  A is  less  than  or  equal  to  a and  F{b)  is  the  probability  that  Ais  less 
than  or  equal  to  b,  then  we  also  find 

F{b)  - F{o)  = [ dx  f{x)-  Probability  that  a<  X <b  . (C.28) 

J a 

The  probability  density  function  is  sometimes  written  in  the  form  fjx)  where  the  subscript 
denotes  either  the  type  of  distribution  or  the  random  variable  to  which  that  distribution  refers.  The 
letters  p and  P are  also  sometimes  used  instead  of f and  F when  describing  density  and  distribution 
functions. 

Certain  properties  of  density  and  distribution  functions  occasionally  prove  useful.  From 
Eq.(C.25)  we  see  that  the  distribution  function  can  be  obtained  by  integrating  the  density  function. 
Conversely,  the  density  function  can  be  obtained  by  differentiating  the  distribution  function. 

f(x)  = (C.29) 

dx 

The  distribution  function  must  have  limits  of 

F(— oo)  — 0 and  ~ ^ (C.30) 

The  distribution  function  must  also  be  a monotonic  increasing  function  of  increasing  x 

F(x2)>F(xj)  for  ^2  > Xj  (C.31) 

If  y is  a function  g(x)  where  x is  a random  variable  with  density  function  fx(x),  then  the  density 
function  ofy  isfy(y)  and  is  given  by  the  expression 
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where  the  x,  are  the  roots  of  the  equation  = g(x)  = g(x). 

A cumulative  probability  curve  is  a plot  of  the  ""Probability  that  Variable  X has  a value  less 
than  or  equal  to  x”  as  a function  of  “x”,  that  is,  a plot  of  the  probability  distribution  function.  When 
X is  at  its  minimum  value, 

P(X^J  = F(X„J  = 0.  (C.33) 

When  X is  at  its  maximum  value, 

P(X^J=F(X„J  = 1.  (C.34) 

When  X is  at  its  median  value, 

= F(X„e,J  = 0.5.  (C.35) 

In  practice  there  are  a number  of  probability  density  and  probability  distribution  functions 
that  are  commonly  encountered.  In  Table  C- 1 we  list  a number  of  useful  density  functions  and  their 
corresponding  distribution  functions  (when  these  are  calculable  - many  density  functions  do  not 
have  simple  corresponding  distribution  functions).  A number  of  special  functions  are  used  in  Table 
C-1.  These  are  defined  below: 


Error  Function 


Heaviside  Step  Function 


Gamma  Function 


Modified  Bessel  Function 


erf(x)  = \dyey 

^7T  •'0 

(C.36) 

fO  X <a 
U{x;a)  = < 

^ ^ [l  x>a 

(C.37) 

r(x)  = £ dyy''~^e~y 

(C.38) 

r ( A _ 1 

oW-  + 2^  ' 2^ -4^  ' 2^ -4^ -6^ 

(C.39) 

Delta  Function 


There  are  several  definitions  of  the  delta  function.  These 
include: 


Table  C-1.  Common  probability  functions. 


DENSITY  FUNCTION 


DISTRIBUTION  FUNCTION 


Normal  (or  Gaussian)  with  mean  = p and  variance  = 


Dirac 


foiRAci^)  ~ ^(-^9  ^Av) 


^DIRAci^)  ~ ^Av) 


Uniform  (Rectangular  pulse)  with  mean  = Xq  + 0.5  Ax  and  variance  = Ax/T2 

Ta  — ^ ^ — I a — fO  V / 


/y(x;Xo,Ax)  = <^  1 


0 X < x„;  X > x„  + Ax 


Xq  < X < Xq  + Ax 


Fu{x) 


X-Xn 


X < Xq 

Xq  < X < Xq  + Ax 

X > Xn  + Ax 


Binomial  (n  + 1 points,  p+q  = l)  with  mean  = np  and  variance  = np(l-p) 


k=0\^J 


pAx=k)= 

K i 


Poisson  with  mean  = a and  variance  = a 


(x)  = Not  evaluated. 


Pp(x  = k) 


Gamma  --  f(x)  is  maximum  for  x=b/c 


/rW 


b -cx 


r(^+i) 


C/(x;  0) 


Fp(x)=  Not  evaluated. 


Rayleigh  Amplitude 


/«W  = 


-x^nF 


t/(x;  0) 


F^(x)=  Not  evaluated. 


Rayleigh  Power  with  mean  = a and  variance  = a^ 


RAYLEIGH 


(*)  = -e-^'“V(*;0) 


RAYLEIGH 


{x)^[\-e~^'^)u{x-  0) 
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Table  C-1  (continued).  Common  probability  functions. 


DENSITY  FUNCTION 


DISTRIBUTION  FUNCTION 


Rice  with  mean  = //  and  = ratio  of  constant  power  to  average  of  random  power 

. / ^ \ + m \ ) 

f RICE  ~ ^ 


Rice  (m  = 1 ) 


f,ucE{’‘)  = ^e-^^'‘‘U{x;0) 


f 1 

Imll  + nL 

X 

V 

Frice{x)=  Not  evaluated. 


Beta  --  f(x)  is  maximum  for  x=b/(b+cj 

/„(x)  = — — ^ — r — — ^x^(l-x)"  0<x<l  Fg(x)=  Not  evaluated. 

^ r(z>  + i)r(c+i)  ^ ^ ^ 


Cauchy 

f cauchy{^)  ~ 

Laplace 


a ! 71 

2 2 

a +x 


f laplace{^)  n,  ^ 


d —a\x\ 

e 

2 


Fcauchy{^)~  Not  evaluated. 


Flaplace(^)=  Not  evaluated. 


Maxwell 


I \ ^ - 

(xj  = , ^ X e 


2 -x^ Ha^ 


MAXWELL  \- ) 3 

a yin 


U(x;  0) 


FmaxwelM^  Not  evaluated. 


Weibull  with  median  value  = 

.b-\ 


W 


f— 1 

f-1 

= Mn2 

\^m) 

\^m) 

^ x'' 


\^mj 


= \-e 


-(ln2)(x/x„ 


Log-normal  with  mean  (of  In  x)  = ju  and  variance  (of  In  x)  = 


f (r)  - ^ 


Fpj^(x)-  Not  evaluated. 


(C.48) 


(C.49) 


(C.50) 


(C.51) 


(C.52) 


(C.53) 


(C.54) 


(C.55) 
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(C.56) 


<j(x; //)  = lim  e 

fO  X ^ // 

[go  X = // 


fO  x<//;x>//  + Ax 

<J(x;//)=  lim  /^(x;//,Ax)  = <^  1 

^^0  — //<x<//  + Ax 

I Ax  J(C.57) 

fO  X 
[oo  X = // 


The  integral  of  any  delta  function  over  all  values  of  x is  1 , showing  that  the  delta  function 
is  a valid  probability  density  function. 


A common  problem  encountered  in  military  applications  is  the  dispersion  of  “shots”  about 
an  aimpoint,  or  an  analog.  Let  the  aimpoint  be  the  origin  of  coordinates.  The  probability  that  a shot 
will  hit  the  target  at  a lateral  position  r = (x^  + is  given  by  a Gaussian  probability  density 
function 


p{r)  = p{x,y) 


(C.58) 


where  p={rj,  ^ is  the  bias  of  the  mean  and  cris  the  dispersion  (i.e.,  the  standard  deviation)  about 
the  mean.  The  normalization  to  this  distribution  is  )~‘  as  compared  to  the  normalization 

as  shown  in  Eq.(C.40).  This  results  because  we  have  a two-dimensional  distribution  (x, 
y)  rather  a one-dimensional  distribution  (x).  Let  us  set  the  bias  to  zero.  Now  the  probability 
distribution  is 


p{r)  = p(x,y) 


- x2+y2  /2cr- 


iTTa 


iTia 


(C.59) 


The  probability  (P(r^))  that  an  individual  shot  lies  within  radius  a of  the  aimpoint  can  be 
determined  from  the  integral 
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(C.60) 


P{r  < a)  = dr  2w  p{r) 


lUG^  •’0 


The  circular  error  probable  is  the  radius  obtained  using  Eq.(C.58)  for  a probability  of  50%.  That 

is, 


P(r  < CEP)  = 0.5  = 1 - 

(C.61) 

CEP  = {2\n2)"^cj=  1.1 774  Icr . 

(C.62) 
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Statistics 


Statistics  are  used  to  describe  random  functions  when  the  probability  density  function  is 
unknown  or  only  approximately  known.  One  of  the  most  useful  statistical  quantities  is  the  mean  or 
the  expected  value  of  the  function.  The  expected  value  of  a random  variable  X can  be  determined 
from  the  probability  density  function  by 


E{x)  = J dx  X f{x). 


(C.63) 


for  a continuous  random  variable.  If  the  random  variable  is  discrete,  the  expected  value  is 

E(x)  = ^ xj(x^ ) = ^ x^p^  {CM) 

all  X,  all  x^ 

where  />,  is  the  probability  that  the  value  x,  would  be  obtained. 

The  expected  value  of  any  function  gfxj  of  that  random  variable  can  similarly  be  found  from 


= J /W 


(C.65) 


The  expected  value  is  the  average  value  that  would  be  obtained  from  a large  number  of  measure- 
ments. As  such  it  can  be  determined  from  the  probability  density  function  or  it  can  be  found  by 
averaging  a large  number  {N)  of  random  measurements  Xj. 


E(x) 


1 ^ 


(C.66) 


The  mean  does  little  to  describe  the  variability  of  the  individual  measurements.  More 
general  statistical  measures  are  the  moments  of  the  distribution.  Moments  may  be  determined  about 
the  origin  or  about  the  mean.  The  m*  moment  about  the  origin  of  the  distribution  f(x)  is  defined  by 
the  relations 


all  Xi 

for  discrete  random  variables  and 


r„  = £(x")  = 


(C.67) 


(C.68) 


554 


for  continuous  random  processes.  The  mean  /j.  is  given  by  the  first  moment  about  the  origin. 

ju  = E(x)  = Vj  (C.69) 

The  m*  moment  about  the  mean  is  defined  by  the  relations 


=e[(x-/2Y)=Y,  ) 

all  X, 


(C.70) 


for  discrete  random  functions  and 


!^m=  e[{x- lLiy^  = \ Jx(x- /uf  f(x) 


(C.71) 


for  continuous  random  processes.  Obviously  //;  = 0. 

The  following  quantities  computed  from  the  various  moments  are  usually  used  to  describe 
the  statistics  of  a quantity.  These  are 


Mean 

= //=  V; 

(C.72) 

Variance  (=  Var(x)) 

= CE  =jU2=  V2-v/ 

(C.73) 

Standard  Deviation 

II 

b 

II 

(C.74) 

Skewness 

II 

II 

(C.75) 

Kurtosis 

= (a,  - 3)/2  = ((/uJ/^2')  - V/2 

(C.76) 

The  mean  describes  the  “center”  of  the  distribution.  The  variance  describes  its  “width.  The 
skewness  describes  the  asymmetry  in  the  distribution.  The  kurtosis  is  essentially  a measure  of  the 
flatness  of  the  peak  and  the  verticality  of  the  sides  of  the  distribution  (usually  referenced  to  a 
Gaussian  distribution). 

The  mean  of  a collection  of  measurements  is  given  by  Eq.  (C.64).  The  variance  of  that  same 
collection  of  measurements  is  given  by 


Var(x) 


]_ 

N 


~N 


N 

(N 

_i=l 

(C.77) 
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The  median  value  of  a distribution  or  of  a collection  of  measurements  is  that  value  above 
which  one  finds  half  of  the  values  and  below  which  one  finds  the  other  half.  The  median  value 
^median  of  ^ distribution  is  that  value  of  x for  which 


F(x„^dian)  = 0-5 

(C.78) 

J dx  f (x)  = J dx  f (x) 

^median 

(C.79) 

If  the  N elements  of  a collection  of  measurements  are  ordered  by  increasing  value,  i.e.,  x„  < x„+;  for 
all  m,  then  the  median  value  is  Xj^/2.  The  mode  of  a distribution  or  collection  of  measurements  is  that 
value  of  X for  which  the  density  function  is  a maximum,  or  the  value  most  commonly  encountered. 
The  mode  is  the  value  that  is  most  likely  to  be  picked  in  a single  trial. 
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Variance  and  Covariance  of  Vector  Random  Functions 


In  the  preceding  section  we  described  a number  of  statistics  of  simple  random  variables. 
Many  problems  involve  more  than  a single  random  variable.  For  example,  position  involves  three 
variables  (x,  y,  and  z)  each  of  which  may  be  random  and  may  or  may  not  vary  independently. 

The  variance  of  a random  variable  X has  been  previously  identified  as  being  given  by  the 
relation 


Var(X)  = e|(X  - E{X)f  | = e{(X  - E(X))(X  - E(X))}  . (C.80) 

The  covariance  of  two  jointly  distributed  random  variables  is  a generalization  of  the  variance  and 
is  defined  by 


Cov(X,  Y)  = e{(X  - E(X))(7-  E(7))}  . (C.81) 

The  covariance  basically  describes  how  one  random  variable  varies  with  respect  to  the  other  random 
variable.  Note  that 

Cov(X,X)  = Var(X).  (C.82) 

It  is  often  of  interest  to  know  the  degree  to  which  variations  in  one  random  variable  track 
variations  in  another  random  variable.  The  correlation  coefficient  r provides  such  a description. 
The  correlation  coefficient  is  defined  by 


Cov(X,7) 
.^Var(X)-Var(7)  ' 


(C.83) 


If  two  random  variables  move  in  perfect  synchronization  with  each  other,  the  correlation  coefficient 
approaches  unity,  r = 1,  and  the  variables  are  said  to  be  correlated.  If  the  changes  in  one  variable 
show  absolutely  no  definable  relationship  to  the  changes  in  the  second  variable,  then  the  correlation 
coefficient  approaches  zero,  r = 0,  and  the  variables  are  said  to  be  uncorrelated.  If  one  variable 
changes  exactly  the  opposite  to  changes  in  the  second  variable,  then  the  correlation  coefficient 
approaches  minus  one,  r = -7 , and  the  variables  are  said  to  be  anti-correlated.  Two  random  variables 
which  have  variations  with  limited  relationship  to  each  other,  that  is,  -1  < r < 0 and  0<r<l,  then 
the  variables  are  partially  correlated.  The  closer  the  correlation  coefficient  approaches  +1  (-1),  the 
more  highly  correlated  (anti-correlated)  are  the  two  random  variables. 

If  the  random  quantity  is  a vector,  then  we  frequently  find  that  we  need  information  on  how 
one  component  of  that  vector  varies  relative  to  other  components.  We  summarize  all  of  this 
information  in  the  expectation  value  and  covariance  of  that  vector.  For  a random  vector 
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X = {X„X„--,^n}. 


(C.84) 


the  expectation  value  is  just  the  vector  composed  of  the  expectation  value  of  each  component 

e{  v}  = {E(  V, ),  E(  V),  • ■ • , E(X« )}  . (C.85) 

The  covariance  of  the  random  vector  isanNxN  matrix  defined  by 

Z =Cov(x„V^j 

ij 

Var(Xj)  Coy{X„X^)  •••  Cov(Xi,X^)' 

Coy{X„X^)  yar{X^)  •••  Cow{X^,X^) 

Coy(X„X^)  Coy(X„X^)  Var(X^) 
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